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BIASES IN THE SHANKS-RENYI PRIME NUMBERS RACE 

ANDREY FEUERVERGER AND GREG MARTIN 
UNIVERSITY OF TORONTO 

Abstract 



Rubinstein and Sarnak investigated systems of inequalities of the form ir(x;q,ai) > ■ ■ ■ > 
tt(x; q, a r ), where tt(x; q, a) denotes the number of primes up to x that are congruent to a mod q. 
They showed, under standard hypotheses on the zeros of Dirichlet L-functions mod q, that 
the set of positive real numbers x for which these inequalities hold has positive (logarithmic) 
density 5g ;oi) ... )OT , > 0. They also discovered the surprising fact that a certain distribution 
associated with these densities is not symmetric under permutations of the residue classes aj 

f-H ' in general, even if the ai are all squares or all nonsquares mod q (a condition necessary to avoid 

obvious biases of the type first observed by Chebyshev). This asymmetry suggests, contrary to 
ph prior expectations, that the densities <5q ;ai ,...,a r themselves vary under permutations of the cjj. 

In this paper, we derive (under the hypotheses used by Rubinstein and Sarnak) a general 
formula for the densities Sq ;ai ,...,a r , and we use this formula to calculate many of these densities 
when q < 12 and r < 4. For the special moduli q = 8 and q = 12, and for {a\, a 2 , 03} a per- 
mutation of the nonsquares {3,5,7} mod 8 and {5,7, 11} mod 12, respectively, we rigorously 
bound the error in our calculations, thus verifying that these densities are indeed asymmet- 

,_i- ■ ric under permutation of the a^. We also determine several situations in which the densities 

fiq;ai,...,a r remain unchanged under certain permutations of the <Zj, and some situations in which 
they are provably different. 



Key words and phrases: Chebyshev's bias, comparative prime number theory, primes in arith- 
metic progressions, Shanks-Renyi race. 



1. Introduction and Summary. 

In 1853 Chebyshev remarked that there are more primes congruent to 3 than to 1 modulo 4, 
and since that time considerable efforts have been expended in attempts to determine in 
what sense this remark is true. It follows from the prime number theorem for arithmetic 
progressions (see for instance Davenport ||) that, asymptotically, half of all primes are 
congruent to 3 mod 4 and half are congruent to 1 mod 4, so that Chebyshev's observation 
cannot be interpreted in that sense. However, when we compute the numbers of primes up 
to x that are congruent to 3 mod 4 and to 1 mod 4, we find that for most values of x, the 
primes congruent to 3 are more numerous than those congruent to 1. Similar "biases" have 



also been observed, notably by Shanks ||13| , for moduli q other than 4; in particular, the 
numbers of primes in nonsquare residue classes modulo q tend to exceed the numbers of 
primes in square residue classes. We refer to inequities of this type as "Chebyshev biases" . 
These observations lead naturally to the study of inequalities of the type 

7r(x; q, a\) > n(x; q, a 2 ) > • • ■ > tt(x; q, a r ), (1.1) 

where ir(x; q, a) denotes the number of primes p < x such that p = a mod q. Little- 
wood showed (unconditionally) that the inequalities 7i(x; 3, 1) > 7r(x; 3, 2) and 7r(x; 4, 1) > 

1 
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7r(x;4, 3), as well as the opposite inequalities, each hold for infinitely many integer values 
of x. A number of additional results on single inequalities of this type were subsequently 
derived under certain hypotheses by Knapowski and Turan in a series of papers beginning 
with pi, and Kaczorowski wrote several papers concerning the multiple inequalities ( |1.1| ), 
the most recent of which is |JJ . 

A major advance was made recently by Rubinstein and Sarnak |TIJ who showed (con- 



ditionally) that for any modulus q and for any distinct reduced residues a\,...,a r mod q 
(i.e., integers relatively prime to q), the system of inequalities ( |1.1| ) holds for infinitely many 
integers x. More precisely, they worked under the assumption of the Generalized Riemann 
Hypothesis for Dirichlet L-functions, which we shall abbreviate GRH, and an additional as- 
sumption (their "Grand Simplicity Hypothesis" ) that the imaginary parts of the nontrivial 
zeros of Dirichlet L-functions corresponding to primitive characters are linearly independent 
over the rationals, which we shall abbreviate LI. Rubinstein and Sarnak studied the quan- 
tities 5 q;ai; _ >ar , defined as the logarithmic density of the set of positive real numbers x for 
which the system of inequalities ( |1 . 1|) holds. (Here, the logarithmic density 5(A) of any 
subset A of the real numbers is defined as 



5(A) = lim — *— / 



dt 

x^oo log x J\ n [ 2t x] t 



provided that this limit exists. Suffice it to say here that logarithmic densities are more 
appropriate for these problems than ordinary densities; in this paper, by "density" we shall 
always mean logarithmic density.) 

Under the above hypotheses, Rubinstein and Sarnak proved that the densities 5 g;ai ,...,o r , 
exist and are positive for any integer q > 2 and for any distinct reduced residues oi, . . . , o r 
mod q. They obtained, for several small moduli q, numerical values for the density of those 
x for which the primes up to x that are quadratic nonresidues mod q outnumber those which 
are quadratic residues. Rubinstein and Sarnak also proved that 5 q;a>a i = 5 q]a > ta = 1/2 if a and 
a' are both squares or both nonsquares mod q, and otherwise 5 q;a>a i is greater than or less 
than 1/2 according to whether a or a 1 is the nonsquare mod q, thus bearing out the biases 
of the type observed by Chebyshev. 

It was generally suspected for r > 2 as well that whenever the ctj are all squares or all 
nonsquares modulo q, the densities 8 q - iai ,..., ar are invariant under permutations of the cij (and 
thus equal to 1/r!). However, Rubinstein and Sarnak showed that certain distributions 
Mq;ai,...,ar on R r that are associated naturally with the densities 5 q - ai ,... )ar are not symmetric 
under permutations of the ctj when r > 3, except in the special case when r = 3 and there 
exists p ^ 1 mod q with p 3 = 1 mod q such that a-i = a\p mod q and a^ = a\p 2 mod q. (Note 
that since p = p A mod q is a square, it follows that such {ai,a 2 ,a^} are all squares or all 
nonsquares mod q.) This result suggests, but does not imply, that the 5 q;ai) _ >ar are generally 
asymmetric under permutation of the aj. 

In this paper, we rigorously establish a number of asymmetries of this type. Triples of 
nonsquares and triples of squares occur for the moduli q = 7 and q = 9, but these triples fall 
under the special case that has just been mentioned. Therefore the smallest moduli for which 
such asymmetries of the 5 q - ai ,..., ar could arise are q = 8 and q = 12, each of which has three 
nonsquares (and a single square), and q = 11, which has five squares and five nonsquares. 
Our main theorem provides rigorous results for the cases q = 8 and q = 12, subject to the 
two aforementioned hypotheses: 



°8;3,5,7 


— °8;7,5,3 " 


^8;3,7,5 


— <^8;5,7,3 ~ 


^8;5,3,7 


= ^8;7,3,5 - 


"12;5,7,11 


— Ol2;ll,7,5 


"12;5,H,7 


— Ol2;7,ll,5 


"12;7,5,11 


= Oi2;ll,5,7 
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Theorem 1. Assume GRH and LI. Let £ g;ai ,...,a r denote the (logarithmic) density of the set 
of positive real numbers x for which the system of inequalities (\1.1\) holds. Then 

0.1928013 ±0.000001 

0.1664263 ± 0.000001 

0.1407724 ±0.000001 

and 

0.1984521 ±0.000001 

0.1215630 ±0.000001 

0.1799849 ±0.000001, 

where the indicated error bounds are rigorous. 

The pairwise equalities among the <5's in Theorem 1 are not numerical coincidences, but 
are provably exact. In fact there are several situations in which we can establish symmetries 
of this sort. To state these results, we first need to define 

c(q,a) = -1+#{1 < b<q: b 2 = a mod q} (1.2) 

for coprime integers a and q. Note that when q is an odd prime, c(q, a) simply equals the 
Legendre symbol f - ) . Note further that c(q, a) can take only two possible values for a 
given q: certainly c(q, a) = —1 for every nonsquare a mod q, while c(q, a) = c(q, 1) for every 
square a mod q. We can interpret c(q, 1) as the ratio of the number of invertible nonsquares 
to the number of invertible squares mod q. 

We may now state our results concerning symmetries: 

Theorem 2. Assume GRH and LI. Let q,r > 2 be integers and let ai,...,a r be distinct 
reduced residue classes mod q. 

(a) Letting aj 1 denote the multiplicative inverse of aj modulo q, we have 6 q;aij ___ tar = 

(b) If b is a reduced residue class modulo q such that c(q, aj) = c(q, baj) for each 1 < j < r, 
then 5 qm ,...,a r = o~q;bai,-Mr- ^ n particular, this holds if b is a square modulo q. 

(c) // the aj are all squares modulo q and b is any reduced residue class modulo q, then 

"q;a\,...,a r Oq;bai,...,ba r ■ 

(d) // the aj are either all squares modulo q or all nonsquares modulo q, then 5 g;ai) ... :0r = 

Oq;a r ,...,ai ■ 

(e) If b is a reduced residue class modulo q such that c(q, aj) ^ c(q, baj) for each 1 < j < r , 
then 5g ; ai,...,a r = 8q;bar,---Mi- ^ n particular, this holds if q is an odd prime power or twice 
an odd prime power and b is any nonsquare modulo q. 

The pairwise equalities in Theorem 1 are special cases of part (d) of Theorem 2, which 
generalizes the previously mentioned result of Rubinstein and Sarnak that S q - a ^ a ' = o~ q ; a >,a if a 
and a' are either both squares or both nonsquares modulo q. Their other symmetry result, 
that 5 q] a 1 ,a 2 ,a i is invariant under permutations of the aj when there exists p ^ 1 (mod q) with 
p 3 = 1 (mod q) such that a^ = a\p (mod q) and 03 = a\p 2 (mod q), is also a consequence 
of Theorem 2 (specifically parts (b) and (d), the former applied with b = p and b = p 2 ). 

To complement Theorem 2, we can also establish several inequalities concerning the den- 
sities 5: 
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Theorem 3. Assume GRH and LI. Let q > 2 be an integer, let N and N' be distinct 
(invertible) nonsquares mod q, and let S and S' be distinct (invertible) squares mod q. Then: 

(a) S q -N,N',S > 8q;S,N',N,' 

(b) S q . N> s,S' > $q;S',S,N; 

(c) 5 q - Nt s,N' > $q-,N',s,N if and only if 8 q - NtS > S q;N ^ s ; 

(d) 8q;S,N,S' > $q;S>,N,S if and Only if8 q -S,N > S q - S ',N- 

Parts (c) and (d) of Theorem 3 are further examples that the predisposition towards some 
orderings of {7r(x; q, a±), . . . , n(x; q, a r )} over others cannot be explained solely in terms of 
the Chebyshev bias that encourages nonsquares to run ahead of squares in the prime number 
race. (See also the discussion of "bias factors" in Section 6.) 

The most general result in this paper is an explicit formula for an arbitrary density 
8q;ai,...,a r - Because of the amount of notation involved, we have deferred the statement of 
this result (Theorem 4) to Section 2.5. We have used this general formula to calculate the 
densities given in Theorem 1, and also a number of the £ g;ai ,...,a r in many interesting cases 
involving q < 12 and r < 4. In these additional computations we have not undertaken to 
rigorously bound the error terms; nevertheless we believe, from numerical considerations, 
that the results given in Section 4 are accurate to the number of decimal places indicated. 

We shall assume the hypotheses GRH and LI throughout this paper. In Section 2 we 
provide our main analysis leading to Theorem 4, the general formula for 5 g;01j ... j0r . The 
rigorous bounding of the error terms incurred during the calculation of the densities in 
Theorem 1 is carried out in Section 3. Details of the computations and the additional 
numerical results are collected together in Section 4. The proofs of Theorems 2 and 3 are 
given in Section 5, while in Section 6 we provide concluding remarks, noting some possible 
directions for further work. 



2. Analytic Determination of the Densities 8 qm ,...,a r - 

The goal for this section of the paper is to derive Theorem 4 (see Section 2.5), a general 
formula for the densities S q;ait ... jar . We begin by developing some notation and citing the 
relevant results of Rubinstein and Sarnak in Section 2.1. In Section 2.2 we investigate the 
function p q - ai ,...,a r which will figure prominently in the arguments that follow, while in Section 
2.3 we establish some facts about Cauchy principal values of multidimensional integrals; these 
sections are technical rather than conceptual in nature, and the reader may wish to examine 
these only briefly on the first reading. Because the general formula given in Theorem 4 
and the arguments leading to it are somewhat involved, in Section 2.4 we first detail the 
derivation of this formula for the special cases Ss- a ,b,c and d~i2-,a,b,c occurring in Theorem 1; the 
derivation of the formula in the general case is then carried out in Section 2.5. We assume 
the hypotheses GRH and LI throughout. 

2.1. Notation and Background Results. We begin by establishing the notation neces- 
sary for discussing the results of Rubinstein and Sarnak. For any coprime integers q and a 
and any real number x > 1, define 

log a; 



E(x;q,a) = —=((j>(q)ir(x;q,a) -tt(x)), (2.1) 
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so that E(x; q, a) is an error term for the number of primes congruent to a mod q, normalized 
so as to vary roughly boundedly as x varies. Since the inequalities ir(x; q, ax) > ■•■ > 
7r(:r; q, a r ) hold if and only if E(x; q, ai) > • • • > E(x; q, a r ), we wish to study how often the 
vector 

E q - ai ,...,a r (x) = (E(x; q, ai), . . . , E(x; q, a r f) (2.2) 

lies in the region {(xi, . . . ,x r ) G R r : X\ > ■ ■ ■ > x r }. Notice that if r = 0(g) then the cij 
form a complete set of reduced residues mod q, in which case we see from equation ( [2.1|) that 

log x 

E(x; q,ai)-\ h E(x; q, a r ) = -=<j){q)uj{q) (2.3) 

\/x 



where u>(q) denotes the number of distinct prime factors of q. 

Rubinstein and Sarnak showed, assuming GRH, that the function E q . ai ^^ ar (x) has a lim- 
iting distribution /i g;a i,...,a r , m the sense that 

I rX. dx f f 

J lm 1 V / f( E q;a 1 ,...,a r (x)) — = ■■■ f(Xl,...,X r )dH qm ,...,ar (2.4) 

for all bounded, continuous functions / on R r . Under the further assumption of LI, they 
showed that the distribution fi q - ai ,...,a r is absolutely continuous with respect to the ordinary 
Lebesgue measure on R r . (The exception is the case r = <p(q), when equation ( [2.3|) implies 
that the distribution fj, q - ai ,...,a r is supported on the hyperplane x\ + ■ • ■ + x r = 0; in this case, 
^q;ai,...,a r * s absolutely continuous with respect to Lebesgue measure on this hyperplane.) 
Consequently, the equation ( |2.4| ) holds when / is the characteristic function of any reasonable 
subset of R r (specifically, a measurable subset whose boundary has Lebesgue measure zero 
in R r ). In particular, it follows from the definition of 5 qm ,...,a T that 

<W,..,a r = <5({x e R: 7r(x, q,a x ) > ■■■ > n(x, q, a r )}J 

= ^ q ;a-i,...,a r \\X G K '. X\ > • • • > X r ]\ /^ r\ 

~ ^/^g;ai,...,a r ■ 

xi>—>x r 

Another consequence of the absolute continuity of ii qm ,...,a T is that the set of positive real 
numbers x for which n(x; q, a) = ir(x; q, a') has density zero when a and a' are distinct reduced 
residues; indeed this is even true of the larger set {x: \ir(x;q,a) — ir(x;q,a')\ < $(x)} for 
any function $ such that 

, im -^L- = o. 

x-^oo ^J x j log x 

Next we develop the notation needed to write down Rubinstein and Sarnak's seminal 
formula for the Fourier transform £i qw ,...,a r of the distribution fi q - ai ,...,a r - In this paper we use 
the normalization 

/(6, • • • , £«) = /• • • / e-^*i+~+tr* n ) f ( Xu ..., Xn)dxi ... dXn (2.6) 

for the Fourier transform of an integrable function / on R n , so that the Fourier inversion 
formula (assuming that / is itself integrable) is 

f( Xl , ...,*„) = (2vr)-™ J- ■ ■ J e^+'-'+^/te, . . . , £ n ) ^i • • • «„ • 
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Likewise we write 

A&, ■•-, e») = /•■■/ e -^ i+ - + ^ dp 

for the Fourier transform of a finite measure \i on R r , so that the Fourier inversion formula 
(assuming that fi is integrable with respect to Lebesgue measure) is 

dfi = (2n)- n (J---J e^ xl+ "'^^fi^ 1 , . . . , £„) d& . . . dtiA dxt . . . cte n . (2.7) 

To write down the specific Fourier transform /x g;ai ,...,o r ,, we recall the standard Bessel func- 
tion of order zero, 

^o ( m! ) 4 64 

and then set 

^,x) = II U^z) (2.9) 

7>0 
i(|+J7,x)=0 

in terms of the Dirichlet L-function L(s, x) corresponding to the Dirichlet character x, where 
we have defined 

<*7 = J— - (2-10) 

I + 7 2 



(Since we are assuming GRH, the product in equation (|2.9|) is indexed by all the nontrivial 
zeros of L(s, x) in the upper half-plane.) For later use in numerical approximations of F(z, x) 
we also define the truncated version 

F T (z,x)=( II JoK^))(l + fei^ 2 ) (2.11) 

V 0<7<T / 



0<7<T 
L(|+i7,x)=0 



for any positive real number T, where 



1 



bi=b 1 (T,x) = -T,T—- 2 - (2-12) 

The polynomial factor in the definition (|2.11|) of Ft is motivated by the fact that, in view of 
the power series expansion (2.8) of J , b\ is the coefficient of z 2 in the power series expansion 

of I\^>TJo( a i z )- 

With this notation in place, we can now give the formula [|10 , equation 1.2] of Rubinstein 

and Sarnak for the Fourier transform /2 g;ai ,...,a r of the distribution fiq- ai ,...,a r - They showed, 
assuming GRH and LI, that 



/*«;o 1 ,... I a r (^i,--- > 6-)=exp(z53c(g,o i )Cj) II F [ 



j=i / x m °dq 

x¥=xo 



3=1 



X), 

> (2.13) 



where c(q, a) was defined in equation ( |1.2|) . This result will be used extensively in the sequel. 

Since Jo(0) = 1 we clearly have F(0, x) — Ft(0, x) — 1 f° r an y character %. It is known (see 

for instance the arguments in [|3], Chapters 15-16]) that for a fixed character x-, the number 

of zeros of L(s,x) with imaginary part between and T has order of magnitude TlogT. 
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From this it can be shown that the product (|2.9| ) defining F(z, x) converges uniformly on 
bounded subsets of the complex plane, and hence F is an entire function. For later use we 
will need bounds for the decay rate of F(x, x) and its derivatives F^ N '(x, x) on the real axis; 
this is the subject of the following lemma. 

Lemma 2.1. Given a modulus q > 2 and a nonnegative integer N, there exist positive 
constants /?i and j3 2 such that 

\F (N \x, X )\ <^e~ HA 
for all real numbers x. 

We caution the reader that in the next three sections, the constants (3\ and (3% will not nec- 
essarily have the same values at different occurrences; each statement should be interpreted 
as holding for some suitable positive values of (3i and j3 2 . 

Proof: In this proof we will use the symbol 7, with or without subscript, exclusively to 
denote a positive imaginary part of a nontrivial zero of L(s,x)- We also use T to denote 
an ordered iV-tuple (71, . . . ,jn), and we let mp(7) denote the number (possibly zero) of 
coordinates of T that equal 7. When convenient we can also assume that x > 1, since F is 
an even, smooth function. From the definition ( |2.9| ) of F(z, x), an iV-fold application of the 
product rule gives us the expression 

F {N \x, X )= E »n • • ■ *w II 4 mrm M 

r=(7i,..,7iv) 7 (2 _ 14) 

= 5Xx,r)F(z,x,r) 

r 
for the iVtli derivative of F(x, x), where we have set 

$(x, r) = J] a7 r ^4 mr(7)) (« 7 x) (2.15) 

7er 

and 

7 ^r 



We can show that F(x, x) decays rapidly on the real axis by using the standard bound pO , 
equation (4.5)] 



|J (x)| <min<M, . —— - \ 
{ V k\x\ J 

for the Bessel function on the real axis. This bound implies that 



|F(x, X )|<II min ! 



/y [ V \Jtfy<X> 



,1 



<IL 



, J /I \ 1/4 



7r|a 7i x] jJi V4 



(2.16) 



for any positive integer J, where the 7^ have been indexed in increasing order. Choose 
J = J(x) to be the number of zeros of L(s, x) up to height x/2. For any < 7 < x/2, 
it is easily verified that the factor (7r|x|)~ 1 / 2 (l/4 + 7 2 ) 1//4 does not exceed 1/2. Therefore 
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the upper bound Q2.16Q implies that \F(x, x)\ < 2~ J . Since the order of magnitude of J is 
xlogx, this argument shows that as x tends to infinity, \F(x, x)\ decreases at least as fast 
as a function of the form c xlogx for some constant c depending on x- 

The same conclusion holds for F(x, x, r), since removing the indices j in equation ( j2.16| ) 
for which 7,, e T changes J by at most N and thus does not affect the order of magnitude of J. 
Certainly then there exist positive constants fa and fa (depending only on iV and x) such 
that \F(x, X, r)| < fae~ " /32 ' :c ' for all real numbers x. Since this implies from equation Q2.14| ) 
that 



\F(x,x)\<Pie- p2[xl J2®( x > T ) 



(2.17) 



the lemma will be established (possibly with different values of fa and fa) if we can show 
that this last sum is bounded by some polynomial function of |x|. 

To this end, we employ the crude bounds |Jo(£)| — I an d l^o (^)l — 1 f° r the derivatives 
of the Bessel function, which follow easily from the integral representation 

2 W 2 



Mt) 



n Jo 



cos (t sin 6) d9. 



Again supposing that x > 1, the definition ( |2.15| ) of $(x, T) leads to the bound 



i$(x,m < 



n 

m r(7) =1 



Ct-, Uj 



n 

. 7er 

m r(7)>l 



a 



m r (7) 



It follows that 



$>(x,r) 



<\xfE 



a 



max{mp(7),2} 



< \x\ N N\Y[(l + a 2 1 + a* + --- + a^) 



Since the jth. constant a 7 has order of magnitude l/jj ~ (log j)/j, this last product converges 
to some constant depending only on x- Combining this bound with the inequality ( |2.17p 
establishes the lemma. □ 

Of course it also follows from the first line of equation ( |2.16| ) that \F(x, x)\ is bounded 
above by 1 on the real axis. 

In Sections 3.1 and 3.5 we will need to make use of the fact that fJ, q - au ...,a r can also be 
thought of as the joint distribution of a certain set of r real- valued random variables, and it 
is convenient to exhibit these random variables explicitly at this time. For given values of q, 
r, and ai, . . . , a r , define the vector 

b qm ,-,ar = -(c(g,ai),...,c(g,a r )). 
Next, for any character x mod q, define both the vector 

V q - ai ,...,a r (x) = (x(Ol),...,xK)) 

and the random variable 

X( X )= £ « 7 sin(27r[/ 7 ), (2.18) 

7>0 
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where the a 7 are as in ( |2.10| ) and the £7 7 are independent random variables uniformly dis- 
tributed on [0,1]. Note that by the hypothesis LI, the 7's corresponding to different L- 
functions are distinct, so that a given U 1 only appears in the definition of one of the X(x); 
consequently the random variables {X(x)} are mutually independent. Then Rubinstein and 
Sarnak showed that the distribution p q;ai ,...,a r is in fact the same as the probability measure 
corresponding to the random vector 



J q;a 1 ,.. 



+ X! X (x)Vq; ai ,...,a r (x)- 



Xmodg 

Xt^XO 



2.2. The function p q]ai ,...,a r - in this section we introduce the function p q . a 
which we define by the formula 



R r 



Pq;ai,...,a r (Vl, ■■, Vr-l) = frq;ai,...,a r {Vl,V2 ~ Vl, 



,TJ r -i - r] r -2, -Vr-l) 



(2.19) 



C. 



(2.20) 



so that 



Pq 



«1 , 



■(m 



Vr-lj 



cxp 






c(q, a 



j+i. 



-i=i 



m 



X 



n f \ 



Xinodg 

Xt^Xo 



/ ± 



v., 



X) (2.21) 



from the formula ( p2. 13| ) for p q - ai ,...,a r - We will see in Sections 2.4 and 2.5 that p q . 



ax, 



is the Fourier transform of a certain measure p q: 



in. 



on R r associated with p qm ,.. 



We remark that in the special case where the ctj are all squares or all nonsquares, we have 
c(q, a±) = ■ ■ ■ = c(q, a r ) and so the exponential term in the formula (2.21) is identically 1, so 
that p q - ai ,...,a r is real- valued and symmetric with respect to reflection through the origin. 

The function p q - ai ,...,a r will feature significantly in the remainder of this paper, and it will 
be important to establish some of its smoothness and decay properties. To avoid frequent 
repetition of the same properties, we shall say that a function / on R n is well-behaved if 
it has continuous derivatives of all orders and if there exist positive constants (3\ and fa 
such that, for every subset {ji, . . . ,jk} of {1, ... , n}, the mixed partial derivative — '~ 
satisfies the inequality 

Qkf 



dxj ...dx 



:>k 



OXj 1 . . . OXj k 



[Xl, 



• x n ) 



<fae 



-MA 



(2.22) 



where ||x|| = ||(xi, • . • , x n )|| = Jx\ + • • • + x\ is the Euclidean norm of x. This criterion 
must also be satisfied for the empty subset of {1, . . . , n}, so that the actual values of / must 
also be bounded by the right-hand side of ( [2.22J ). Certainly any well-behaved function is 
integrable as well. We remark that all of the functions shown to be well-behaved below in 



fact satisfy an inequality analogous to ( 2.22 ) for partial derivatives of all orders; however 
our proof of Lemma 2.4 below only requires this assumption on the mixed linear partial 
derivatives. 

It is easily seen that finite sums and products of well-behaved functions are again well- 
behaved. If / and g are well-behaved functions on R m and R n , respectively, then fg is a 
well-behaved function on R m+n ; conversely, the restriction of a well-behaved function on R™ 
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to any subspace denned by setting certain variables equal to zero is a well-behaved function 
on that subspace. Also, if L: R m — > R n is an injective linear map and / is a well-behaved 
function on R n , then the composite function / o L is a well-behaved function on R m : the 
partial derivatives of / o L will just be linear combinations of the partial derivatives of /, 
and the fact that L is injective means that ||L(x)|| is bounded below by a constant multiple 
of ||x||, so that the estimate fl2.22| ) for / on R™ can be converted to a similar estimate for 
/ o L on R m 

The following two lemmas establish the important fact that the functions p q - ai ,...,a r are 
well-behaved. 



Lemma 2.2. For every subset {ji, . . . ,jk} of {1, . . . ,r}, 
d k 



dxj l . . . dx 



jk 



3=1 



X) < Pi exp -fc 



J2x(a 3 )xj 



i=i 



(2.23) 



for some positive constants fi\ and $2- 
Proof: Define 



G(xi 



X r'i X) 



J2x{aj)xj 

3=1 



■X 



The argument of F on the right-hand side of this definition involves a modulus and hence 
implicitly a square root, which could potentially cause discontinuities in the derivatives 
of G when this argument equals zero; however, the Bessel function J is even, whence the 
function F(x, x) involves only even powers of x in its power series expansion about the origin. 
Consequently, G has continuous derivatives of all orders. Note also that it suffices to establish 
the upper bound ( [2.23J ) when \YHj=iX{ a j) x j\ > 1) since the bound on the complementary 
set follows immediately from continuity (with some value of /?i). 

If we write F(x,x) = F(J\x\,x), then it is easy to check by induction that the nth 
derivative of F equals 



F (n \x, X ) 



n 

fc=l 



F (k)f 



x\,x \x 



-n+j/2 



for some constants a 



n,k' 



In particular, when \x\ > 1 we see from Lemma |2.1| that 



(2.24) 



for some positive constants (3\ and /3 2 . 
In this notation we have 



G(x 



i- 



x ri x) 



3=1 > 



Im^x(ai)^ 

3=1 ' 



X 



Suppressing the details, we note that the mixed partial derivative dx d G dx 



can be computed 

using the product rule as a combination of three types of expressions: derivatives of F eval- 
uated at | Y7j=x x( a j) x j\ 2 j linear factors of the form 2 Re(x( a k) X^=i x{ a j) x j)i an d constants 



of the form 2Re(x( a k)x( a k'))- From equation (|2.24j ), the expressions of the first type can 
be bounded above by /?iexp(— j3 2 \ Y7j=i x( a j) x j\)i while the expressions of the other types 
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grow only as fast as a polynomial in | YZ=i x( a j) x j\- This establishes the lemma for suitable 
positive values of (5\ and Pi- D 



Lemma 2.3. The function p qm ,...,a r is well-behaved for any integers q, r > 2 and any distinct 
reduced residues {at, . . . , a r }. 



Proof: From the formula (|2.13|) , the function jx q;ai ,...,a r certainly has continuous derivatives 
of all orders (see the proof of Lemma |2.2p , and thus the same is true of p g - ai ,...,a r - We 
begin by examining the behavior of the mixed partial derivatives of the function fi q - ai ,...,a r - 
Let S = {ji, . . . , jk} be a subset of indices from the set {1, . . . , r}, and let ^— denote the 
result of taking the partial Xj-derivatives for every j in S. The product rule applied to the 
formula ( [2.131 ) for £iq ]ai ,...,a r yields 



dx- 



~Pq;ai,...,a r {<,) / , 



So,{Sx} 



Qk / r \ Qk 

exp Kcfeo^ Yl a F \ 



i=i 



Xinodq 

Xt^XO 



E *(%)& 



X 



5.25) 



where the outer summation is taken over the finitely many partitions of the index set S 
into S U (U x ^ X o S x ). Each mixed partial derivative of the exponential term is bounded, 
while from Lemma [2.2| each mixed partial derivative of F(\ Xa=i x( a j)£j\i x) is exponentially 
decaying as a function of its argument. We conclude from equation ( |2.25| ) that there exist 
positive constants Pi and p 2 such that 

Qk / 

As;oi,...,a r (0 <Pl II exp -^ 2 



dx s 
where we have defined 



\modq 

Xt^XO 



E X{a r )$,j 

3=1 



Pie 



-foQ(0 1/2 



(2.26) 



Q(0 = Q q -, ai ,..,a r (0 = ( E 



\ xmodg 

Xt^XO 



Ex(%)6 



We thus seek a lower bound on Q(£). 
We may certainly write 



Q(0> E 



Now 



E 

Xmodtj 



Xmodg 

Xv^XO 



E x(ai)& 



r 


2 


ExK)^- 


= E 


i=i 


Xmodg 



E *(%)& 






EE&& E X(a*)x(< 



«=1 j;'=l xmodg 

by the orthogonality of the characters \. Therefore 



£3 



Q(0 >#?)££- £ 6 



i=i 



-i=i 



(2.27) 



We assume for now that r is strictly less than 4>(q), commenting at the end of the proof 
on the slight differences in the case r = <p{q). The quadratic form on the right-hand side of 
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the inequality (|2.27|) turns out to be positive definite when r < <p(q), and so we can write 

Q(0><P(q)KU\\ 2 , (2.28) 

where A r is the smallest eigenvalue of that quadratic form. From the inequalities Q2.26| ) 
and (|2.28|), it follows that 



AW,-r(0 <Pie~ M 



dx s 

for some different positive constants /?i and /3 2 . Since the index set S C {l,...,r} was 
arbitrary, this shows that the function p q;ai ,...,a r is well-behaved. 

Furthermore, from its definition (|2.20|) the function p qm ,...,a r is simply the composition 
of fi q - ai ,...,a r with the injective linear transformation (7/1, ... , r} r -i) 1— ► (r]i, 772 — r)i, ■ ■ ■ , Vr-i ~ 
r] r -2, — T7r— 1) from R r_1 to R r . As mentioned before, this implies that p q]ai ,...,a r is itself a 
well-behaved function. 

When r = <f)(q), the function p q - ai ,...,a r is invariant under translation in the direction of 
the vector (1, . . . , 1), and so it is not well-behaved even though it has the required decay 
properties on the hyperplane orthogonal to (1, . . . , 1) (one can check that the quadratic 
form on the right-hand side of the inequality ( |2.27|) is positive semi-definite when r = <p(q), 
with its zero set being the multiples of the (1, . . . , 1) vector). However, the image of the 
linear transformation (r/i, . . . , r] r -i) *- ► (t)i, t]2 — Tji, . . . , r? r _i — r] r -2, —Vr-i) lies within this 
hyperplane, so we can still deduce that p q - ai ,...,ar i s well-behaved even when r = 0(g). This 
establishes the lemma. □ 

Of course we also have the trivial bound \p q;ai ,...,a r \ ^ 1- Lemma |2T3| implies in particular 
that p q - ai ,...,a r is integrable, and consequently the Fourier inversion formula (|2.7| ) is valid for 
Pq;a u ...,a r , becoming 

d Pqm ,...,a r = (27r)" r ([■■■[ e i ^^ + - + ^ X ^p qm ,..., ar ^ 1 , ...,&.)#!... d£r) ^1 • • ■ dx r . 

\ J J J (2.29) 

2.3. Multidimensional Cauchy principal values. In one dimension, the Cauchy princi- 
pal value 

P.V. r IM- dx = lim / f -^dx 

J-oo X e-*0J|a;|>e X 

is a familiar object. For our purposes it will be necessary to make use of the multidimensional 
analogue 

P.V. / • • • / — — dx\ . . . dx n = lim / • • • / — — dx\ . . . dx n ; 



' ' Xl '-- Xn e ~*\in { U-k\}>e Xl '-- Xn (2.30) 

in particular, we would like to know that this limit exists. The purpose of this section 
is to establish the existence of these multidimensional Cauchy principal values for well- 



behaved functions, a class which by Lemma |2T3| includes the functions p q - ai ,...,a r discussed in 



the previous section. We remark that while the lemmas in this section could certainly be 
obtained under somewhat weaker hypotheses, they suffice for our purposes as stated. 
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Lemma 2.4. Let f be a well-behaved function on R n that vanishes whenever any of the 
first k coordinates x\, . . . , Xk equals zero. Then the function f(xi, . . . , x n )/x\ . . . x k extends 
across the coordinate hyperplanes to a continuous integrable function satisfying the upper 
bound 

f{xi, 



x r , 



Xi...X k 



<Pie 



-fo\\x\\ 



(2.31) 



for some positive constants (3\ and ft 2 . 



Although this lemma holds in one dimension without any assumptions on the derivatives of 
/, already in R 2 one can construct an exponentially decaying, smooth (even real-analytic) 
function f(x,y) that satisfies f(0,y) = for all y but for which f(x,y)/x is not integrable. 

Proof: The fact that f{x)/x\ . . . x k extends across the coordinate hyperplanes to a contin- 
uous function follows from the fact that / has continuous derivatives of all orders; therefore 
only the upper bound ( |2.31| ) remains to be proved, since integrability is a consequence of 
this bound. Furthermore, by continuity it suffices to establish this upper bound when none 
of the variables equals zero. Also, if all of the \xj\ are bounded by 1 then the function 
f(x)/xi . . . Xk is uniformly bounded; therefore we may assume (after inflating the constant 
j3\ if necessary) that there exists an Xj with \xj\ > 1. 

Permuting the first k variables if necessary, we can choose an integer 1 < m < k such that 
< \xx\, . . . , \x m \ < 1 and |x m+ i|, . . . , \xk\ > 1. Since / vanishes when x\ equals zero, there 
exists a number t\ with |ti| < |a?i| such that 

df 

f(Xi, ...,X n ) = f{Xl, ...,X n )~ /(0, X 2 , • • • , X n ) = X 1T —(t 1 ,X 2 , ...,X n ) 

OX\ 

by the mean value theorem in the variable x\. Similarly, / vanishes whenever x 2 equals 
zero, so in particular -^- equals zero when x 2 = 0. Therefore, there exists a number t 2 with 
\t 2 \ < \x 2 \ such that 



df_ 

dxi 



(ti,X 2 , . . . ,x n ) 



21 

dx\ 



(ti,X 2 , . . . ,x n ) 



df_ 

dx\ 



(ti,0,x 3 ,...,x n ) 



x 2 - 



d 2 f 



■(ti,t 2 ,X 3 , 



' dx\dx 2 

by the mean value theorem in the variable x 2 . Continuing in this way, we find numbers tj 
with \ti\ < \xA for each 1 < i < m such that 



f(xi 



%r, 



It follows immediately that 



Xl 



< 



X r 



d m f 



dx\ . . . dxj 



it, 



' ^"rtii %m-\-li 



i 3>r, 



d m f 



dxi . . . dx r 



- \Tl, ■ ■ ■ , <J mi Xra-^x 



i Xr 



x x ...x k 

since \x m+1 \, . . . , \x k \ > 1. 

Since / is well-behaved, there exist positive constants (3\ and j3 2 such that 

d m f 



(2.32) 



dx\ . . . dx r 



(tt, ...,t m , x m+1 , ...,x n ) < (3 X exp (-/5 2 ^i H h t 2 m + x m+i H 1- x 



(2.33) 
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But notice that 

t x + • • • + t m + x m+l + • • • + x n > 2^ x i — ~ Z^ x j • 

i<i<« i=i 

|x,|>l 

Since we are working under the assumption that at least one of the \xj\ exceeds 1, we can 
use this fact in the inequality Q2.33D to see that 

-(t, t r ,1 t) < /3 lP -#2lMIA/^ 

\ts\, . . . , <j m i -^m+l; • • • i • L "n) _ Ml° 



9xi . . . dx r 

Combining this bound with the inequality Q2.32p , this establishes the lemma (upon replacing 
fo/y/nbyfc). D 

For the proof of the next lemma, as well as for the formulation of the general formula for 
^;ai,...,ar (Theorem 4 in Section 2.5), we require the following notation: for a function / on 
R n and a subset B of {1, ... , n}, define 

f(B) = f{B){{ Xj : j e B}) = f(9 h . . . A) (2-34) 

where 9j = Xj if j G B, and 9j = otherwise. For example, if n = 6 and -B = {2, 4, 5} then 
f(B) is a function of the three variables x 2 , x 4 , and x 5 , namely f(B) = f(0,X2,0,x 4: ,x 5 ,0); 
in general f(B) will be a function on the appropriate \B\ -dimensional subspace of R n , where 
\B\ denotes the cardinality of B. In the case B = we simply have f(B) = /(0, . . . , 0). 

Lemma 2.5. If f is a well-behaved function on R n , then 

P.V. /".../" /(X1 '---' X " ) dan . . . dx n 



is well-defined; i.e., the limit in equation \2. 3(\ ) exists. 

Proof. Let g\{x) be an even, well-behaved function on R 1 with g'i(O) = 1 (for instance, 
we might have in mind g\{x) = e~ x ), and let g(x\, . . . ,x n ) = gi(xi) . . .gi(x n ). Define an 
operator G on well-behaved functions / by 

G(f) = G(f)( Xl ,...,x n )= £ (-l) n -Wf(B)g(B) (2.35) 

Bc{l,...,n} 

in the notation of equation ( p.34 ), where B denotes the complement {1, . . . ,n}\B of B. 
Since / and g are well-behaved functions, the same is true of G(f). 

Consider the term in ( |2.35| ) corresponding to some particular proper subset B of {1, ... , n}. 
If we choose £ ^ B, then the term f(B)g(B) can be written as g(xe) times a function 
independent of X£. Thus f(B)g(B) is an even function of x#, and hence integrates to zero 
against any odd function of xe- In particular, 

f(B)g(B ) 

iXj ]_ • • • d-"F 

min{|xi|,...,|a; n |}>e 

for any positive e and any proper subset B of {1, . . . , n}. Since the term in the sum ( |2.35| ) 
corresponding to B = {1, . . . , n} is simply the function / itself, we see that 



/ mm dXl ,„ dXn = 

J Jb 1 • • • i//T" 



71 ■ dx\ . . . dx n = / ■ ■ ■ / dx\ . . . dx n 



min{|zi|,...,|a;„|}>e min{|a;i|,...,|a;„|}>6 \L.6K>) 
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for any e > 0. 

On the other hand, we claim that G(f) evaluates to zero when any of the variables xt 
equals zero. To see this, let B be a subset of {1, ... ,71} not containing I. When %i = 
we see that the term (— l) n ~\ B \ f (B)g(B) corresponding to B in the sum (|2.35| ) reduces to 
(-l) n ~\ B \f(B)g(B \ {£}). On the other hand, the term 



(-lf-l Bu WI/(5 U {£})g(B U {£}) 



■1 



,n-l-|B| 



f{BU{t})g{B\{£}) 



corresponding to B U {£} reduces to (— l) n 1 \ B \f(B)g(B \ {£}) when X£ = 0. It follows that 



when xi = 0, the terms in ( |2.35| ) will cancel pairwise in the natural pairing between the 
subsets of {1, . . . , n} not containing P. and those containing t. 

Because of this, Lemma ^\ tells us that the function G{f)/x\ . . . x n is integrable, whence 
the dominated convergence theorem implies 

G(f) 



lim / ' " " / dx\ 

e->o J J Xi . . . x n 

min{|xi|,...,|x„|}>e 






JU \ • • • J- 1 X, 



dx\ . . . dx r 



(2.37) 



This together with equation (|2.36|) shows that the principal value ( p.30|) exists — in fact it 
equals the integral on the right-hand side of equation |2.37| . □ 



Lemma 2.6. If f is a well-behaved function on R n ; then for any 1 < k < n, 

n-k I I J{ x li---i x n)Xl-..Xk 



lim c v 



R" (c z + xi 



TV 



(c 2 + x\ 

n-k p -y 



dx\ . . . dx r 



f(x 1 ,...,x k ,0,...,0) 



R k 



Xi...X k 



dxi . . . dxk- (2.38) 



Proof. We proceed along lines similar to the proof of Lemma |27|. Analogously to the 
definition Q2.35D of the operator G(f), define the operator 

G k (f) = G k (f)( Xl , ...,!„)= £ {-l) k -\ B \f{B U {k + 1, . . . , n})g(B), 

Bc{l,...,k} 

so that G k (f) is itself a well-behaved function. The arguments leading to the validity of 
equation (|2.36|) in the proof of Lemma |2.5| show that the function G k (f) — f integrates to 
against any function that is odd in each of the variables xi, . . . , x k separately. In particular, 

J \3>li • • • j •Kn)-Kl ■ ■ ■ %k 



„n—k 



(c 2 + x\) . . . (c 2 + X 2 ) 



dx\ . . . dx r 



„n—k 



G k (f)(x 1 ,...,x n )x 1 ...x k 
(c 2 + a; 2 ) 



.-, .-. ax 1 . . . ax n . 
[c + £„) 

Making the change of variables Xj = c^j for k < j < n and rearranging terms, we see that 
n _ fc f [G k (f)(x 1 ,...,x n )x 1 ...x k 



(c 2 + xj) 



(Jj*jU~\^ m m . (JjJjYl 



(c 2 + x 2 nJ 
G k {j){Xi, . . . , x k , cx k+ i, . . . , cx n )x 1 . . . x k 



(c* + xi)...(cz + xl)(l + xl +1 )...(l + xl) 



dx\ . . . dx n , (2.39) 
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where we have defined 

G k (f)(x) 



G k (f)(x) 



Xi . . . x k 



As in the proof of Lemma |2.5| , we can check that G k (f) evaluates to zero whenever any of 
the first k variables equals zero, and thus by Lemma |2.4| the function Gk(f) is continuous 
and integrable and satisfies an upper bound of the form 

|G*(/)(z)|</3 ie -fclMI (2.40) 

for some positive constants /3i and /3 2 - 
Now define 



b c [X\, . . . , Xk 

max 

\t h +i\,-,\t n \<VB 



fte-a*!!, if ||x|| > 1/y/c, 

Gk(f)(xi, ■ ■ ■ ,Xk,tk+i, ■ ■ ■ ,t n ) 

it ||x|| < 1/y/c. 



l + xi +1 )...{l + xi 



One can check that the integrand on the right-hand side of equation (2.39) is bounded 
in absolute value by S c {x\, . . . ,x k ) when < c < 1. Moreover, the continuity of G k {f) 
implies that S c is bounded on the set {x G R fe : ||x|| < 1/y/c}, and therefore S c is integrable. 
Furthermore, both S c and the integrand on the right-hand side of equation (2.39) tend 
pointwise to the function 

G k (f)(x 1 ,...,x k ,0,...,0) 

(l + xl +1 )...(l+xl) 

as c tends to zero, and this function is also integrable by the exponential decay |2.40] of 
Gk{f)- Therefore, taking limits on both sides of equation (2.39) and using the generalized 
dominated convergence theorem, we conclude that 

c™+ C J J ( C 2 + X?)...(C 2 + X2) aX ^-- aX " 



f G k (f)(x 1 ,...,x k ,0,...,0) 
-J (l+xl +1 )...(l + xl) dXl '--^ 

n n- k f.. fG k (f)(x 1 ,...,X k A...,0) dxi ^ 

J J X\ . . . x k 



But just as in the proof of Lemma |2.5|, this last integral equals the principal value of the 



integral of f(x\, . . . , x k , 0, . . . , 0)/xi . . . x k , which establishes the lemma. □ 

Of course, the lemma would also hold if both occurrences of the product x\ . . . x k in equa- 
tion (2.38) were replaced by any product Xj x . . . Xj k of k distinct variables (and the variables 
of integration on the right-hand side adjusted accordingly). 

2.4. Analysis for the special case. In this section we derive analytic expressions for the 
density <5s ; 3,5,7 (the logarithmic density of the set {x G R: n(x; 8, 3) > n(x; 8, 5) > 7r(x; 8, 7)}) 
and for the other densities in Theorem 1. These formulas are special cases of Theorem 4, 
which will be established in the next section; however, we present a complete analysis in 
these special cases to illustrate and motivate the techniques in the general case. 
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We begin by noting the special case 



<5 8 ;3,5,7 = J J J d H8;3,5j(x, V, z) 

x>y>z 

of equation Q2.5|) . Making the change of variables u = x — y, v = y — z, and w = z gives 

^8;3,5,7 = J J J dV8fl fi) r{u,V,w), 

«>0«>0toeR 

where the measure v qAl ,...,ar is defined, in obvious notation, by 

^8;3,5,70, V, w) = /i 8; 3,5,7(« + V + W,V + W,w), (2.41) 

or equivalently p,s-,3,5 77 (x,y,z) = ^8;3,5,7(^ — y,y — z,z). Integrating out the w variable, we 
obtain 



<W,7 = / / dp 8 . 3A7 (u,v), (2.42) 

u>0v>0 

where we have defined, again in obvious notation, 



ps;3,b,7(u,v) = J du Bia>s ,r(u, v, w) . (2.43) 

It is easily checked that the Fourier transform of ps;3,5,7 is related to that of /X8;3,5,7 via 
P8;3,5,7(£, v) = A8;3,5,7(£, V — £, — v)i which is a particular case of equation ( |2.20| ). 

We can appeal to the formula (2.21) for p q;ai ,...,a r to write P8;3,n, 7 (£, v) explicitly. Recall that 
a discriminant is an integer congruent to or 1 mod 4, and a fundamental discriminant D is 
an integer that cannot be written in the form D = dn 2 for some discriminant d and integer 
n > 2. For any fundamental discriminant D, let xd denote the character Xd( u ) — ( ) using 
Kronecker's extension of the Legendre symbol (see Davenport H Chapter 5]). Then the three 
nonprincipal characters mod 8 are simply x_ 8 , %_ 4 , and Xs- I n this setting, equation (2.21) 
becomes 

/W(£, V) = F(\2£\,x-a)F(\2r} - 2£|, X _ 4 )F(| - 2t/|, Xs ), (2.44) 

showing that P8;3,5,7 is real-valued and symmetric with respect to reflection through the ori- 
gin. The same argument gives formulas for S g . at b, c for any permutation {a,b,c} of {3,5,7}, 
where the arguments of the F(-,x) functions in equation (|2.44| ) simply are permuted ac- 
cordingly. Since each F(z, x) is an even function, we can omit the absolute value signs in 
these arguments. Similar remarks hold for the modulus 12, where the three nonprincipal 
characters are X-4, X-3; an d Xi2- 

Using the monotone convergence theorem and the Fourier inversion formula (|2.29|) , equa- 
tion ( 2.42 ) becomes 

<5 8;3 , 5 ,7 = c hm J J e-^ u+v Up 8 , 3 ^ 7 (u,v) 

u>0v>0 

Jim J J e -« u+ ^ {^ J J e^ + ^p 8 . 3 , 5 , 7 (Z, V )dt;d V } dudv. 



u>0d>0 
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We next use Fubini's theorem to write 



^8:3,5,7 =: — — lim 



PS;3,B,7^V)i I I e<~ C+i ^<-^ dudv) <%d V 



4tt z c^o+ J J J 7 I 

k u>0»>0 



4tt 2 c-+o+ 7 7 (c — i£) (c — w/) 

i hf/tofejI^^-W^ < 2 - 45 > 



4tt 
where we have defined 



47r 2 c _ +7 7 (c 2 + £ 2 )(c 2 + 77 2 ) 

2 (^8;3,5,7 + ^8;3,5,7 — ^8;3,5,7)> 



G »*« = jsa. c2 / / ^ff#^) **> • (2 ' 46) 

^3,5,7 = c hm C y y (c2 + a(c2 + //2) ^, (2.47) 



and 



8;3 , 5 ,7 = lim / / 2 ' V 2V ' 2 ^^ ^ d V ■ ( 2 - 48 ) 



P8;3,5,7(^^)^ 
c^0+ J J (c 2 + £ 2 ) (C 2 + 7] 

In equation (|2.46|) we make the change of variables a = £/c and (3 = rj/c to obtain 

P8;3,5,7 (ca } cf3) f f p 8 ;3,5,7(0,0) 2 



^ 3 < 5 ' 7 = J& 7 J (l + a 2 )(l + /3 2 ) ^ = 7 y (l + a 2 )(l+/? 2 ) M/5 = * 

where we have again used the dominated convergence theorem together with the trivial 
bound |p 8;3; 5 5 7(^', 77) I < P8;3,5,7(0, 0) = 1. Next, we note that H 8 . 3j57 equals zero since the 
integrand in equation ( |2.47| ) is odd under reflection through the origin. Finally, we observe 
that equation (J2.48 ) may be written as 

(P8;3,5,7(6, V) - P8;3,5,7(C, 0)/3 8;3 , 5)7 (0, 7/)) £/? 



t ^• \r 0,0,0,1 v."s ) / ro,o,o,i \^i " /ro,o,o,i \"i ■ / / >< jt- j 

Jw = c™J J ( c 2 +a(c 2 +^) d ^ 

since the term introduced is odd in either variable separately and so integrates to zero. This 
is the same as 

T V f f P8;3,5,7(^, V) - P8;3,5j(£, 0)P8;3,5,7(0, V) gV ACA 

h ^ = }^J J l v (c 2 + a(c 2 + ^) didv - 



Note that the expression Ps;3,5,7(£> v)~P8;o,5,7(£, 0)/3 8; 3 5j7 (0, rf) is well-behaved by Lemma p73 



and since ps ; 3,5,7(0, 0) = 1, it evaluates to zero when either £ or 77 equals zero. Therefore, 
the first fraction in the integrand can be extended across the £ and 77 axes to a continuous 
integrable function by Lemma |2.4| . We may thus use the dominated convergence theorem to 
see that 

J 8 . 3 5 7 = / / ^,5,r(e^)-P8 ; 3,5,r(e,0)P8;3, 5 ,r(0^) ^ ^ 

7 7 £77 

Note that this integral may be written as the multivariate Cauchy principal value 

W = P-V./ J ^0^ d^dr, (2.50) 
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as discussed in Section 2.3, since Ps;3,5,7(£> 0) and ps;3,5j{0, v) are even functions and hence 
the term omitted in passing from ( 2.49|) to ( |2.50| ) is odd in either variable. (Of course, we 
could have arrived at ( |2.50| ) directly from the definition of is;3,5,7 by invoking Lemma 2.6; 
however, not only is this derivation more concrete, in keeping with the spirit of this section, 
but we will also need the formula (|2.49|) during our error analysis in Section 3.) 



It follows that the right-hand side of equation ( |2.45| ) can be evaluated to give 



d; 



8:3,5,7 



1 
4 



( 8;3,5,7 



1 

4 



1 



P.V. 



P8;3,5,7(£,V) 



d£drj. 



(2.51) 



4rr 2 °'°' d '' 4 An 2 ' J J ^ 

where Ps ; 3,5,7 is given explicitly in equation ( [2.44|) . The identical argument, of course, applies 
for evaluating 5s- a ,b,c for any permutation {a, b, c} of {3, 5, 7} to yield 

i Lpv f /" fegjip fc *?) 



"8;a,6,c 



d£drj, 



4rr 2 ' J J £r] 

where ps-, a ,b,c is defined via obvious analogy to ( |2.41| ) and ( j2.43| ), and similarly 

1 1 py / f Pl2;a,b,c(£,V) 



(2.52) 



>12;a,b,c 



4rr 2 



tv 



d£di], 



(2.53) 



for any permutation {a, b, c} of {5, 7, 11}. 

We remark that the numerator of the integrand in (|2.49|) may be viewed as a "measure of 
dependence" in the Fourier domain for the bivariate distribution of a random vector (X, Y) 
in R 2 having density ps;3,5,7- in fact, the integrand in equation ( |2.49| ) is the Fourier trans- 
form of the natural dependence measure based on factorizability of the bivariate cumulative 
distribution function corresponding to Ps;3,5,7- This interpretation is important in Section 
3.1, where a random vector (X, Y) of this type is analyzed to yield bounds for the tail of the 
measure p 8;3 ,5,7- 

2.5. Analysis for the general case. We are now at the point where we have the notation 
and tools needed for the statement and proof of a general formula for the densities 8 q;au ,„ >ar . 

Theorem 4. Assume GRH and LI. Let q, r > 2 be integers, and let a 1; ... , a r be distinct 
reduced residue classes mod q. Then 



S. 



g;ai,...,a r 



2 -(r-U 1 + 



E 

Bc{l,...,r-1} 
B^0 



\B\ 



P.V. 



Pq: 



01, 






(2.54) 



where p q . ai 

Pq;a 



r (B) uses the notation of equation $2.34 ) applied to the function 
i,...,a r (Vi, ■■■, Vr-x) = exp f ^2(c(q, aj) - c(q, %+i))^- J 



x 



n ^i 



Xmodg 

x¥=xo 



r-l 



EM a i) -x(aj-i))Vj 



i=i 



X 



Proof: We follow the strategy used for the special cases in Section 2.4. Our starting point 
is equation Q2.5p, 



0q;ax,...,Or — ^pq;ai,...,a r 



Zl 



, . . . , X r 



x\>X2>--->x r 
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We make the change of variables u\ = x% — x%, ... , u r -\ = x r _i — x r , u r = x r to obtain 

S qm ,...,ar= /'"/ ^g;ai,...,a r («l, • • ■ , «r) , 

U1>0, ..., « r -l>0 

u r £R 

where the measure f q - ai ,.. ., ar is defined, in obvious notation, by 

Vqm,...,ar{u>U ■ ■ ■ ,U r ) = fiq;a U ...,ar(Ul H h M r , M 2 H h W r , . . . , U r ) , 

or equivalently jU g; a 1) „. I a r 0£i, ■ ■ -,x r ) = v q - ai ,...,a r {^i - x 2 , . . . , av-i - x r , x r ). Integrating out 
the u r variable leads to 

8q;a u ...,ar = / ••" / ^Pg;a lv ..,a r («1, • • • , Mr-l) , (2.55) 

«i>0 u r _i>0 

where we have defined (again in obvious notation) 



Pq;ai,...,a r (Ux,---,U r - 1 )= / du q . aij _ )(lr (u U . . . , W r _i, v) 



ueR 



It is easily checked that the Fourier transform of p q;ai ,...,a r is related to that of p q - ai ,...,a r by 
the identity (^20|) . 



At this point, our goal is to evaluate the integral on the right-hand side of equation Q2.55|) 



in terms of the Fourier transform p q - ai ,...,a r of p qm ,...,a T . The correct final formula could be 
obtained by writing this as the integral of dp q;air __ tar against the characteristic function of the 
region of integration, and using Parseval's identity in the context of the theory of generalized 
functions; the following analysis derives this final formula rigorously. 

Using the monotone convergence theorem followed by the Fourier inversion formula ( |2.29| ) . 
equation ( |2.55| ) becomes 



<W..,a r = c hm + / • • • / e «(«i+-+«r-i) dp q , ai _ ar {u 1} . . .,Ur- X ) 

Ml>0 U r _l>0 

= \im j ■■■ f e - c («i+-+«r-i)|(2 7r )-(»- 1 ) 

m>0 u r _i>0 ^ 

x J.-.J e i ^ + --- +u <--^-^p qm ,..., ar (Z 1} . . . , &-i) d&... d£r-i\ du x ... du r ^. 
Then by Fubini's theorem this becomes 

$q m ,...,a r = ( 27r ) _(r_1) c 1 i™ f _/■•■/ /3«ai,...,ar(£l,...,6-l) 

xl j ... f e «i(-c+^)+-+«-i(-c+^_i) ^ c/ ^_ i I d ^ rf ^_ i 

= (2 .)-<-) i,m /. . . / &= -«--"-^-')< c +!f i '--; (c+ ^ l) «. - *-. 

c ^°+ V J (c 2 + £f ) . . . (c 2 + £?_i) 
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and expanding the product (c + i£i) . . . (c + i£ r _i) leads to 

<W.,«r = (Svr)-^ £ ^' W-,«r( S ). ( 2 - 56 ) 

Bc{l,2,...,r-1} 



where we have defined 

I qm ,..., ar (B) ^ hmc iy.-y (02 + ^)...^ + ^) «i-«-i 

Appealing to Lemma 2.6 with n = r — 1 and fc = \B\, we see that 

drjj 



I,, au ..., ar (B) = ^-\*\p.V. /-.. J p qm ,.,., ar {B) n ^ 



which includes the special case I g;ai ,..., ar (0) = ?r r *■ Using this fact in equation ( |2.56| ) estab- 
lishes the theorem. D 

We remark that the measure Pq- ai ,...,a r is actually the limiting distribution of the vector 
■(vr(a;;g,ai) - 7t(x; q, a 2 ), . . . ,ir(x; q, o r _i) -vr(x;g,a r )) 



in R r ™ 1 , so its usefulness to the investigation of those x with 7r(x; g, ai) > • • • > 7r(x; g, a r ) 
is not surprising. 

To conclude this section, we consider two special cases of Theorem 4. In the case r = 2 (in 
other words, when we are comparing simply a pair a%, a 2 of residues modulo q) the formula 
( [2.54J ) reduces to 

6 q ., aua2 = -[l + -P.V.J j 

1 1 /■ sin({c(o, ai) — c(o, 02)}??) i-r „ ,, , x / m \ , 
= 2-^j — U FxdxiaJ - xMlv) d V , (2.57) 



Xmodg 



the corresponding cosine term in the last integral being omitted by virtue of symmetry. When 
c(q, Oi) = c(g, 02), the integrand is identically zero and hence S q - aua2 = 1/2, as was proved 
by Rubinstein and Sarnak. In fact, the formula Q2.57|) is analogous to their formula |10| , 
equation 4.1]. 

In the case r = 3, Theorem 4 becomes 



Vq;ai,a2,a3 T + T ir.V. / [Pq- } a 1 ,a2,a 3 {Vi^ ) ~1~ Pq;ai,a 2 ,a 3 {V-i V ) ) 



1 P.V. f [p^^fa^)^^-. (2.58) 



47T 7 7 r]ir) 2 

If the aj are all squares or all nonsquares, then the one- dimensional integral again vanishes 
due to symmetry, yielding a generalization of the formulas ( |2.52| ) and ( [2.53 ) of Section 2.4. 
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3. Rigorous Error Bounds. 

In this section, we describe how the densities in Theorem 1 were calculated and provide a 
rigorous analysis bounding the error between the calculated and true values. 

Suppose that we wish to evaluate #8;3,5,7- According to equation ( |2.51| ), we need only to 
evaluate 

P.V. //**"■'«■"> dfrfr-P.V fff(^.X-»)F12,-2(,X-,)F(-2,, ft )^^ 

J J £r] J J £rj /g j\ 

where we have used the formula Q2.44J ) for ps-,3,5,7- We shall approximate this integral by 
sampling the integrand on the (symmetrically offset) grid of points 



me ne 
~2~'T 



me 



ne 



< C, m,n odd 



for some appropriately small e > and some appropriately large C > 0. In fact the quantity 
we actually compute is 45 , 8 ; 3 > 5 )7 (e, C, T), where we define 

q ( nrr\- Y-v- FAme, X-&)F T {{n - m)e, X-i)F T (-ne, Xs) ,„ , 

\m\,\n\<2C/e " blL 

m,n odd 

here Ft(z,x) is the approximation to F(z,x) defined in equation ( |2.11| ), and as before xd 
is the character given by the Kronecker symbol Xoij 1 ) — (—)■ 

The quantity S$-3 t 5j(e,C,T) is a discrete, truncated approximation to the integral (|3.1| ) 
involving an approximated summand as well. The overall error incurred in evaluating (|3.1| ) by 
means of ( |3.2| ) thus consists of three components: error due to discretizing the integral, error 
due to truncating the resulting infinite sum, and error due to approximating the summand. 
In Sections 3.1-3.3, respectively, we obtain rigorous bounds for each of these sources of error, 
and in Section 3.4 we combine these bounds to establish Theorem 1. Section 3.5 provides 
some technical bounds that are required for our arguments in Section 3.1. While in the 
sections to follow, all of the specific expressions we write down (such as S$-3,5j(e,C,T)) 
are those that arise in the calculation of the single density ^853,5,7, the given constants and 
error bounds were chosen so as to apply also to the analogous quantities arising during the 
calculation of any of the densities listed in Theorem 1. 

3.1. Error Due To Discretization. The first step is to discretize the calculation of /8;3,5,7 
by converting the integral defining l8;3,5,7 into a sum; we may bound the error incurred 
by doing so using the Poisson summation formula, as we now explain. Let f(£,rj) be a 
continuous, integrable function on R 2 such that both / and / decay rapidly enough near 
infinity (for instance, exponential decay certainly suffices). Then / satisfies the Poisson 
summation formula 

k=—oo£=—oo k=-oo A=-oo \ 1 2 / 

(see for instance Stein and Weiss |b|, Corollary 2.6 of Chapter VII], although we are using a 
Fourier transform ( |2.6| ) with a different choice of constants). In this formula, set e% = e 2 = e 
and a = (5 = e/2, and make the change of variables m = 2k + 1 and n = 2£ + 1 on the 
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left-hand side, to obtain 

, 2 EE/(f ,f)=Mo) + EE /fe^) (-i)-». (3.3) 

m,neZ v z z 7 K ,Aez \ e e / 

m,n odd (re,A)/(0,0) 

Now let 

t(t x P8;3,5j(^V) - P8;3,5,7(C, 0)p 8 ;3,5,7(0, ^) 

which can be extended continuously over the coordinate axes as was noted in Section 2.3. 
This function / is integrable and has exponential decay near infinity by Lemmas |2.3| and p~4 , 
and its Fourier transform can be seen to equal 

f(u,v) = -An 2 (P( Ul v) -P^u^iv)), (3.4) 

where 

__ POO POO 

P(U, V)= / dps-3,5,7 



u Jv 



is the upper cumulative distribution function of P8;3,5,7 and Pi(u) = P(u, —00) and Pi(v) = 
P(— oo,t>) are the corresponding "upper marginals". (Note that f(u,v) is a dependence 
measure of the type mentioned at the end of Section 2.4.) At the end of this section we will 
show that the function / decays exponentially as well, so that we are justified in applying 
the form ( |3.3j ) of the Poisson summation formula to /. 
Now observe from equation (2.49) that 



Iw,sj = J J f(t,v)d£dri = f(0,0). 

Therefore applying equation ( |3.3| ) to the function /, we have 

t 2rv P 8 . 3)5)7 (me/2, ne/2) - p 8;3i5i7 (me/2, 0)p 8;3i 5 i7 (0, ne/2) 

h;3,5,7 = £ LL 7 77^7 — 77^ + Errori, 

tfniz (me/2) (ne/2) (35) 

m,n odd 

where Errori, the error due to discretization, is given by 

Error 1 = 4vr 2 EE (p( , ) - Pi( W ))(-ir +A - (3.6) 

K ,Aez e e e e 

O,A)^(0,0) 

Defining 
Q(u,v) = (p(27iu,2tiv) -T 1 {2txuJP 2 {2txv)\ + (p(-2im, 2irv) -T 5 1 {-2wu]P 2 {^v)) 
+ (p(2ttu, -2nv) - Pi(27ra)P 2 (-27u;)) + (P(-2nu, -2nv) -P x {-2txu)P 2 {-2ixv) 
and grouping the terms on the right-hand side of equation ( |3.6| ) analogously, we obtain 

A ' 

e 



Erro ri = ^(j2U-^ K+X Q '(-,-) + \ £(-W(",0) + ± £(-l) A Q(o, 

V K.Aez+ e e L nez+ e z Aez+ 



,Aez+ ^ ^ ksz+ ^ Aez 

so that 



|Error 1 |<4vr 2 EE |<?(-,-)|. ( 3 - 7 ) 



;,A>0 
(k,A)^(0,0) 



e e- 
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Now let (X, Y) denote a pair of real- valued random variables whose joint distribution is 
given by ps-,3,5,7 (these random variables are given explicitly in equation ( |3.33| ) below, though 
their explicit form is not needed here). Then P(u,v) = Pr(X > u, Y > v) and hence 
-Pi (it) = Pr(X > u) and P2(v) = Pr(Y > v). With this interpretation, and using the fact 
that P8;3,5,7 is symmetric about the origin, the identity 

Q(u, v) = Pr(X > 2nu, Y > 2irv) - Pr(X > 2vra, Y < -2nv) (3.8) 

is easily verified. Clearly 

< Pr(X > u, Y > v) < min{Pr(X > u), Pr(F > v)}. 

Moreover, since ps;3.5,7 is symmetric about the origin, each component X and Y is a sym- 
metric random variable, so that 

< Pr(X > u, Y < -v) < min{Pr(X > u), Pr(F < -v)} = min{Pr(X > u), Pr(F > v)}. 

It therefore follows from the identity ( p.8| ) that 

\Q{u,v)\ < min{Pr(X > 2tiu),Pt(Y > 2nv)}. (3.9) 

In Section 3.5 we shall establish the bounds 

Pr(X > u) < exp(-0.04(w - 3) 2 ) 

Pr(F > u) < exp(-0.04(w - 3) 2 ) 

for any u > 3. Hence by the inequality (|3.9|) , 

\Q(u, v) | < exp (-0.04(2tt max{u, v} - 3f 

if either u or v exceeds 1, so that by equation (|3.7|), 



(3.10) 



(Erroril < 4vr 2 [ ^ ^ exp (-0.04(— 3) 2 

V k=0 A=max{K,l} C 

OO OO r\ ^ \ 

+ E E exp (_0.04(— -3) ) (3.11) 

A=0 re=max{A,l} / 

^ 8vr 12 12 exp(-0.04( 3J 

k=0 A=max{K,l} 

if e < 1, say. Now for any positive integer A , 

J2, / /2ttA \2n / /27rAn \2 

12 ex P (~0.04( 3J J < 2 exp (-0.04( - 3J 

a=a e ' ' v e 

since each term of the sum is at most half of the preceding term. Applying this inequality 
twice in succession to the bound (|3.11| ) gives 

|Errori| < 87r 2 ( 2exp (-0.04(— -3) 2 ) + 2 ^ exp (-0.04(-^ - 3) 2 ) 

<48Tr 2 exp(-0.04( 3J 

We therefore conclude that 

|Errori| < 5 x 1(T 12 (3.12) 



BIASES IN THE SHANKS-RENYI PRIME NUMBERS RACE 25 

for any choice of e < 1/5, which is more than adequate for our purposes. 

To conclude this section, we return to the matter of showing that the function / given 
in equation (ft.4j) decays exponentially. In terms of the random variables X and Y, the 
formula 03.41) becomes 



f( u , v) = -4rr 2 (Pr(X > u, Y > v) - Pr(X > u)Pi(Y > v)). 
By an argument similar to the one used for the function Q, we see that 

\f{u,v)\ < 47r 2 max{Pr(X > u, Y > w),Pr(X > u)Pi(Y > v)} 
< 4tt 2 min{Pr(X > u), Pr(Y >v)}. 
On the other hand, by elementary considerations we have 

f(-u, -v) = 4rr 2 (Pr(X > -u, Y > -v) - Pr(X > -u)Pv(Y > -v)) 

= 4tt 2 ((1 - Pr(X < -u) - Pr(F < -v) + Pr(X < -u, Y < -v)) 

- (1 - Pr(X < -n))(l - Pr(F < -v))) 
= 4vr 2 (Pr(X < -u, Y < -v) - Pr(X < -u)Pt(Y < -v)). 
By the same argument as in equation ( fj. 13| ) we see that 

\f(-u,-v)\ < 4vr 2 min{Pr(X < -u),Pr(Y < -v)} 
= 47r 2 min{Pr(X > u),Pr(Y > v)} 

since X and Y are symmetric. We can therefore apply the bounds ( |3.10|) to conclude that 
|/(-u, -v)\ < 4tt 2 exp (-0.04(max{|u|, |u|} - 3) 2 ) 

if either \u\ or \v\ exceeds 3. In particular, the function / decays (faster than) exponentially, 
as claimed above. 

3.2. Error due to truncating the infinite sums. From equation (|3.5|) we have 



T A V-V- P8;3,5,7(W2,ne/ 2 ) -P8;3,5,7(W2,0)/S 8 ;3,5,7(0,W2) „ 

^8;3,5,7 — 4 /_,/_, + -t^rrori 

rr mn 

m,n6Z 
m,n odd 

= 4^ 8;3 ,5,7(e) + Error! (314) 

where we have defined 

o (A ^^ p 8; 3,5,7(^e/2,ne/2) 

m,n odd 

(The term that has been omitted in the latter equality in equation ( |3.14j ) equals zero, since 
P8;3,5,7(^ e /2, 0)p 8; 3 j5i 7(0, ne/2)(mn) _1 is odd in either variable separately due to the symme- 
try of the functions p 8;3]5i 7(me/2, 0) and p 8; 3,5j(0, ne/2) through the origin.) At this point we 
have accomplished the first step of converting our integral /8;3,5,7 m to a discrete sum, with a 
manageable error; the next step is to truncate the ranges of summation so that the resulting 
sum has only finitely many terms. 



(3.16) 
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From the formula (|2.44|) for P8 ; 3,5,7, the definition (|3.15|) becomes 

c r \ _ v- v- F (me, X-8)F((n - m)e, x~4,)F(-ne, Xs) 

»J8;3,5,7lej — 2^2^ 

~Z mn 

m,n odd 

= Sgj3,5,7(e, C) + Error 2 , 
where we have defined the truncated series 

S (e C) = V'V' F ( me > *- s "> F (( n ~ m ) e ' X-4)F(-ne, xs) ,g ^ 

\m\,\n\<2C/e mn 

(the primes indicating that the sums are taken over only odd values of m and n) and the 
error due to truncation 

Error = T'T' F ( me ' X-8)F( (n - m)e, X-i)F(-ne, Xs) 



max{|m|,|n|}>2C/e mU 



We rewrite this as 



{m/2 m 

E' E' + E' E' 
m>2C/e n=-m m>2C/t n=m/2 

^ F(me,xs)F((n-m)e,X-4)F(-ne,X8) 
mn 



+ y' v' + v' v' - (3 is) 

n>2C/e m=-n n>2C/e m=n/2 J 



where the factor of 2 comes from grouping together the terms corresponding to (m, n) and 
(— m, —n) by the symmetry of the summand through the origin. 

To bound Error 2 , we will certainly need explicit estimates for the functions F(x, x) on the 
real axis. We recall the upper bound Q2.16D , 



\F{x,x)\<{A4Y J/2 i{{\ + i*) 



1/4 



where J is any positive integer and < 71 < 72 < • • • are the imaginary parts of the 
nontrivial zeros of L(s, x)- Any particular choice of J gives an upper bound of the form 

\F(x,x)\<d( X )\xr {x) (3.19) 

for some positive constants d(x) an d e(x)- F° r an y fixed x the optimal choice of J is the 
largest integer such that (rrx) 2 > \ + 7j; but for our present purposes, we obtain sufficiently 
good results that are easy to apply uniformly in x by choosing J so that jj is just less 
than 30. Table |3.1| lists, for each of the five characters x relevant to the densities mod 8 and 
mod 12, the values of J chosen and the resulting values of d(x) and e(x) in the bound ( |3.19| ), 
which we computed from the lists of zeros of the L(s, x) supplied to us by R. Rumely. 
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X 


J 


dix) 


e(x) 


X-s 


56 


1.3 x 10 3 ^ 


28 


X8 


56 


2.1 x 10 3 ^ 


28 


X-4 


46 


8.5 x 10 2e 


23 


X-3 


42 


7.5 x 10^ 4 


21 


X\2 


62 


3.0 x 1(P 


31 



Table 3.1. Allowable constants in the bound ( p,19|) for \F(x, %)| 



Since |-F(x, x)| is also bounded by 1 on the real axis, we can estimate the first double sum 
in equation (3.18) by 



m/2 

m>2C / e n=-m 



F(me, x-n)F((n - m)e, x-i)F{-ne, xs) 



inn 



m/2 

< E E 

m>2C/e n=~oo 



F(me, x-8)F{{n - m)e, x~ 



m 



(3.20) 



m/2 



< d(x~8)d(x-4) e e(x_8) e(x_4) E E m ' ' "' "('"-". 



-e(x-s)-!/ 



-e(x-4) 



m>2C/e n= 



using the bound ( |3.19| ) for X-s and X-4- 
Now we claim that 



m/2 



E E ™~ a ( 



m — n 



,-P 



E E 



m a n ^ 



m>M n=— oo 



m>Af n=m/2 

< 2 l3 - 1 M 1 - a - (S 



+ 



M 



a + /3-l (a + /5-2)(/5-l) / 



(3.21) 



for any real numbers a,/5 > 1. The equality is clear upon making the change of variables 
n i— ► m — n, while the inequality follows from the elementary argument 



£ £ m-V < £ 



m 



m>M n=m/2 



m>M 



rm 
2~ 



J\m/2] , 



m>Af 



m 



l-/3> 



2 / ' /3 — 1 V 2 

< 2 P / t""-' 3 dt + / t l - a ~P dt 

Jm 3 — 1 Jm 



2 /3 



a + /3- 1 



/3 

M l-a-/3 + 



2 /3- 



(/3-l)(« + /3-2) 



M 2-a-/3 
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this last expression being equivalent to the right-hand side of (3.21). 

Applying the upper bound (3.21), with a = e(x-s) + 1 and j3 = e(x~i), to equation ( |3.20| ) 
gives 



m/2 



E E 

m>2C/e "=-m 



' F(me, X-8)F((n - m)e, X-4,)F(-ne, xs) 



inn 



< rf(x- 8 )rf(X-4)e~ e(x " 8Ke(x " 4) 2 e(x - 4)_1 ( — ) 

2C/e 



/2C\ ~ e ( x - 8 ) _e (x- 4 ) 



+ 



V e (X-s) + e(x_ 4 ) (e(x-s) + e(x-4) - l)(e(y_ 4 ) 
d(X-8)rf(x~4)2~ e(x - 8) C- e(x - 8) - e(x - 4) 



(3.22) 



x 



+ 



C/t 



e(x- 8 ) + e(x_ 4 ) (e(x-s) + e(x_ 4 ) - l)(e(x_ 4 ) - 1) 



Substituting the appropriate values from Table |3.1| , we find that this last expression is less 
than 1.85 x 1(T 7 when e = 1/20 and C = 15. 

The second double sum in equation (3.18) may be similarly bounded as 



E E 

m>2C/e n=-m/2 



/ F(me, x_ 8 )F((n - m)e, X-^)F(-ne, Xs) 



< E E 

m>2C/e n=m/2 



mn 
F(me,x-8)F(-ne,Xs) 



m 



< d(x-s)d(x8)e' eix - s) ~ e{xs) E E m-^-^n-^ . 

m>2C/e n=m/2 

Applying (|3.19|) to bound this last expression yields just the right-hand side of equation ( |3.22| ) 
except with x_ 4 replaced with x 8 ; upon substituting values from Table |3J] we find that this 
expression is also less than 1.85 x 10~ 7 when e = 1/20 and C = 15. The third and fourth 
double sums in (3.18) are treated the same way, and so we conclude from equation (3.18) 
that 



|Error 2 | < 8(1.85 x 10~ 7 ) < 1.5 x 10" 



(3.23) 



when e = 1/20 and C = 15. 



3.3. Error due to approximating F(z, x) by Ft(z, x)- We have accomplished the second 
step of approximating the infinite sum 5 , 8;3,5,7( e ) by the finite sum Sg^^e, C); however, this 
latter sum is still unsuitable for computation, since it involves the functions F(z, x) which 
are infinite products. The last step is to replace the functions F(z, x) by their truncated 
counterparts F T (z,x) defined in equation (|2.11|) . 
Recall the definition ( |2.12| ) of &i, 



b l = b 1 (T, X ) = -T,T 



7 >T 4 



+ i 2 
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and put 

M»,x)= n r^ ( y ) -i- (3.24) 

From the definitions (|2.9|) and ( |2.11| ) of F and Ft we see that 

F{z, X ) = F T (z, X )(l + &t{z, X ))- 
Making this substitution in equation (|3.17| ) for %_§, x_ 4 , and xs, we then obtain 

S 8 ;3,5, 7 (e, C) = S 8 ;3, 5J (e, C, T) + Error 3 , (3.25) 

where Ss-3 t 5j(e,C,T) is as defined in equation (|3.2| ) and 
Err r- V'V' F ^( m6 ' X-sj^HQ - rnje, X-j F T (-ne, x%) 

\m\,\n\<2C/e Vfln 

x ((H-A T (me,x-8))(H-A T ((n-m)e,x-4))(H-A r (-ne,X8))-l). (3.26) 

Regarding the size of the function At, Rubinstein and Sarnak [10[ Section 4.3] established 
the inequality 

blx A 



2(1 — |6i|a: 2 ) 



Y[ Jo^x) - (1 + bxx 2 ) < 

.■y>T J 

for real numbers x satisfying \bi\x 2 < 1. From the definition ( |3.24| ) of At this immediately 
yields 

\A T (x y)| = in 7 >T^oK^)-(l+M 2 )| bjx* 

1 Tl ' XJI |l + &ix 2 | < 2(1 - \h\xY ll] (327) 

The quantities b\ can be computed if we know all the zeros of L(s, x) up to height T, since 

bl = 2., 1,2 ~ 2^ 1,2 

0<7<T 4 ' ' 7>0 4 ~ ' 

and we have the formula (see Davenport |3|, p. 83]) 

7 >0 4+^ /! 74+^" :7r/: ^ M X »X) (3.28) 

for a real primitive character x mod q, where 70 = 0.577215 ... is Euler's constant. The 
values L(l, x) can be calculated in closed form by classical formulas (again see M), while the 
values L'(l,x) can be calculated in closed form using a formula of Selberg and Chowla [I2"| for 
the odd characters and a formula of Deninger M for the even characters. The former formula 
expresses L'(l,x) in terms of the logarithm of the T-function, while the latter expresses 
L'(l,x) in terms of a function R(x) defined as 

here ((s,x) is the Hurwitz zeta function, defined when x > — 1 by ((s,x) = J2^=i( n + x)~ s 
for Re s > 1 and by meromorphic continuation to the complex s-plane. 

The Mathematica software package is capable of calculating both logT(x) and R(x) to 
arbitrary precision, and thus by the formula ( |3.28j ) the sums D 7 >o V(i + 7 2 ) can a l so t> e so 



30 



ANDREY FEUERVERGER AND GREG MARTIN 



UNIVERSITY OF TORONTO 



X 


L(1,X) 


#(i,x) 


v- 1 


Z^ 7 >U I/4+7 2 


X-s 


TV 

2\f2 


2^V70 + log ^7T + log r(1/8)r(3/8) ) 


0.158037 


X8 


log(l+\/2) 


^( 7o + log2vr + R(l) - i?(§) - i?(§) + i?(§)) 


0.117716 


X-4 


7T 
4 


f( 7 o + log2vr + 21og^@) 


0.077784 


X-s 


7T 

3a7S 


^( 7o + log27r + 31o g ™) 


0.056615 


Xl2 


log(2+V3) 
v^3 


^(70 + log2vr + H(i) - fl(A) - fl(£) + fl(g)) 


0.165083 



Table 3.2. Values of L(l, X ), L'(l, X ), and £ 7>0 jji^p 



calculated. Table 3.2 contains the results of such calculations for the five characters relevant 



to the densities mod 8 and mod 12. 

For all five of these characters, when we choose T = 10,000 we find that |&i| < 0.000173. 
The upper bound (|3.27|) can then be written more simply as \At(x, x)\ < D(x) for \x\ < 74, 
where we have defined 



D(x) 



1.5 x 10~ 8 x 4 



(1 - 0.00018x 2 ) 2 

Consequently, the definition (3.26) of Errors implies the inequality 

F T (me, x_ 8 )F T ((n - m)e, X-4)F T (-ne, Xs) 



(3.29) 



ran 



| Err or 3 1 < J2 Z 

\m\,\n\<2C/e 



The quantity on the right-hand side of this inequality was computed at the same time as the 
sum S8-3 57(6, C, T) was computed, and we obtained the bound 



x ((l + D(me))(l + D({n-m)e))(l+D(-ne))-i). (3.30) 



|Error 3 | < 5.5 x 10 . 



(3.31) 



3.4. Conclusion. From the relationships (|3.15|) , ( |3.16|) , and (|3.25|) among the various in- 
termediate sums <S8;3,5,7> we have 

h;3,5j = 45 , 8; 3,5j(e, C, T) + Errori + 4Error 2 + 4Error 3 . 

Using this identity in equation (|2.51|) yields 



1 



1 



1:3,5,7 



4 4tt 2 
whence it follows that 



'-8;3,5,7 



4 4rr 2 



(45 , 8;3,5,7( e ; C, T) + Error ! + 4Error 2 + 4Error 



3), 



8:3,5,7 



1 ff 8;3 , 5 ,7(e,c,r) 

4 



7T Z 



I Errori I I Error? I + |Error 3 | 



4rr 2 



7T 



Thus by the inequalities (|3.12| ), (|3.23|) , and (|3.31| ), we conclude that 

1 S 8;3>5>7 (e,C,Ty 



8* 



8;3,5,7 



4 



7T 



<8x 10 



-7 
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when e = 1/20, C = 15, and T = 10,000. Using these values for e, C, and T, the sum 
"S , 8;3,5,7(e, C, T) was calculated and found to equal 0.5645285 . . . , and therefore we have rig- 
orously that 

5 8 ;3,5,7 = 0.1928013 ±9 x 10~ 7 , 

which is slightly stronger than the first assertion of Theorem 1. 

The error analysis in Sections 3.1-3.3 can be repeated for each of the densities in The- 
orem 1; the constants mentioned in the error analysis have been chosen to apply to all of 
these densities. Therefore, the densities calculated for Theorem 1 are all correct to within 
the same margin 9 x 10 -7 , which is enough to establish the theorem. 

3.5. Appendix: Probability bounds. In this section we establish the bounds ( P-lOj ) for 
Pr(X > u) and Pr(Y > u) which were used for the computations in Section 3.1. 

To do so, we first recall from Section 2.1 the explicit form of the random variables having 
the distribution fi q;ai ,...,a r - Specializing the representation ( |2.19| ) to the case q = 8 and 
{oi, O2, a 3 } = {3, 5, 7}, we find that 1^8,3,5,7 is the distribution of the random R 3 -vector 

(1, 1, 1) + X(x- 8 )(l, -1, -1) + X( X _ 4 )(-1, 1, -1) + X( X s)(-l, -1, 1). 



Next, recalling the changes of variables ( |2.41|) and (|2.43| ) that took us from /i to v and then 



to p, we observe that ps;3,5,7 is the distribution of of the random R 2 -vector 

X(x-8)(2,0) + X(x-4)(-2,2)+X( X 8)(0,-2). (3.32) 

Now define the two real-valued random variables 



X = 2 J^ a 7 sin(27rf/ 7 ) — 2 J^ a 7 sin(27r£/ 7 ), 

7>0 7>0 

L(l/2+i 7 , X -8)=0 L(l/2+i 7 , X -4)=0 

Y = 2 J2 a i sin(27rf/ 7 ) - 2 ^ a 7 sin(27rt/ 7 ). 

7>0 7>0 

L(l/2+i7, X - 4 )=0 L(l/2+i 7lX 8)=0 



(3.33) 



We see from the definition (|2.18j ) of the X(x) that the random vector (X, Y) equals the 
random vector ( |3.32j ). 

The following lemma gives information about the tails of random variables of this type. 

Lemma 3.1. Letri > r% > • • ■ be a sequence of positive real numbers such thatY^kLi ^k — 00 
but SfeLi rl = R < 00. Let U\, U-i, . . . be independent random variables uniformly distributed 
on [0, 1], and define the random variable 

00 
W = J2r k sm(2nU k ). 

k=l 

Then for any real number w > 2v\, 

-3(w-2n) 2 



Pr(jy > w) < exp 



1QR 
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Proof. Theorem 1 of Montgomery J8], Section 3] states that under the assumptions of this 

K \ / o / K \ 2 



lemma, we have 



Pr(w>2j2r k )<exp(-l(Y,r k ) / £ r*l (3.34) 

\ fe=l / \ 4 \fc=l / fc>_ftT / 



for any integer K > 1. Since the r^ are decreasing and SfcLi r fc — oo, it is clear that for any 
u; > 2?"! there exists ai^>l such that 



K 

w -^ w 

- - n < ^ r fe < -. 

z k=i z 



With this choice of K, the inequality (|3.34|) simplifies to 

—3(w-2r 1 f 




Pr(W > w) < exp — -(— — ri) / >J r k ) — ex P 



which is the statement of the lemma. □ 



We now apply this lemma to the random variables X and Y defined in equation (|3.33| ). 



(Note that because each variable U 1 is uniformly distributed on [0, 1], we may replace each U 1 
in the second sums on each line with U y + 1/2; this has the effect of changing the subtraction 
signs in the equations ( |3.33| ) to addition signs, thus rendering X and Y into the form to 
which Lemma 3.1 applies.) For the variable X, the sequence corresponding to rj, is 

{2a 7 : L(l/2 + 27, X-s) = 0, 7 > 0} U {2a 7 : L(l/2 + i 7 , X - 4 ) = 0, 7 > 0}. 

For this sequence, the largest element n is less than 1.5, and the sum i? of the squares of 
the elements does not exceed 4.5. Therefore, applying Lemma [34], we find that 

Pr(X > u) < exp(-0.04(w - 3) 2 ) 

for any u > 3. Similarly, Y can be shown to satisfy the same estimate, which establishes 
the upper bounds ( p.lO|) . In fact, the constants mentioned above will work for every pair of 
characters that arises in the computations of P8;a u a 2 ,a 3 , where {01,02,03} is a permutation 
of {3, 5, 7}, and in pi2 ]ai ,a2,a 3 , where {a 1; a 2 , a 3 } is a permutation of {5, 7, 11}. 

4. Computational Results. 

The mathematical and numerical computations described in this paper were implemented 
on an SGI Challenge computer using the Mathematica software package, which has the 
capability to perform computations to arbitrary and verifiable precision (see Wolfram ||15|| ). 
A typical quantity to be calculated is the expression ^^^^(e, C, T) defined in equation (|3.2|) , 
which depends on the functions F T (z,x) defined in equation (|2.11|) . To compute these 
functions we needed, for the Dirichlet L-functions corresponding to characters to the moduli 
q < 12, lists of the zeros whose imaginary parts are bounded by T = 10,000. These lists 
of imaginary parts of zeros (accurate to twelve decimal places) were kindly supplied to us 
by R. Rumely (see [0j). For the estimation of Error3 in Section 3.3 it was also necessary 



to compute quantities typified by the right-hand side of equation (3.30), which is no harder 
than computing S 8 . 3 ^j(e,C,T) itself. 

In addition to the results reported in Theorem 1, a number of further computations were 
carried out involving certain cases with q < 12 and r < 4. In these additional results, which 
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are presented below, we report only the numbers of decimal places in which we have some 
degree of confidence; specifically, we expect the entries to be correct to within one or two 
units in the last decimal place reported. 

Table p~T| shows the calculated densities S qmm for the two-way races between ir(x; q, Oi) 
and 7r(x;q, a 2 ), for the moduli q = 3, 4, and 5. For example, the first line of the table 
indicates that 5 3;2i i = 0.999063 (rounded to seven decimal places). Throughout this section 
we use the symbol iV to stand for any nonsquare mod q and S to stand for any square mod 
q (although distinct occurrences of iV or 5* in a single entry stand for distinct residues) to 
make the Chebyshev biases more clearly evident where appropriate. 

Of course, since (p(3) = (p(4) = 2, the two-way races shown are the only possible races 
for the moduli 3 and 4. The densities for these moduli were calculated by Rubinstein and 
Sarnak and our calculations agree with theirs to six decimal places. (Although they were 
only reported in Jl0| truncated to four decimal places, they had in fact been calculated to 
higher accuracy.) 



For the races modulo 5, it turns out that the densities 6, 



q;ai,a2 



depend only on whether or 
not ai and a 2 are squares mod 5, due to the symmetry results given in Theorem 2. (In fact 
this is true for the races between multiple residues mod 5 as well.) For instance, applying 
Theorem 2(b) with a\ = 2, a 2 = 1, and 6 = 4 shows that S^ 2t i = ^5;3,4; then applying 
Theorem 2(a) to each of these expressions shows further that ^5 ; 2,i = ^5;3,i and 65-3^ = 65-2,4. 
Since the two nonsquares mod 5 are {2,3} while the two squares are {1,4}, these equalities 
show that all four densities represented by ^a^s are equal, as indicated in Table JO]. 

The fact that S q -N t N — $q;S,s — 1/2 as shown on the penultimate line of the table was 
proved by Rubinstein and Sarnak, and it also follows from our Theorem 2(d). We calculated 
these densities anyway, and the calculated answers differed from 1/2 by at most 10~ 16 , which 
is the default machine precision for our Mathematica calculations. This degree of accuracy 



is not unexpected in this instance, as the integral in the formula ( |2.57| ) is identically zero 
when a\ and a 2 are both squares or both nonsquares mod q. 

Table |T2] provides the calculated densities S q;aua27 a 3 for the three-way races modulo 5. 
Again, in this case the densities only depend on whether ai, a 2 , and a 3 are squares mod 5, 
by the symmetry results (a) and (b) of Theorem 2. In addition, each density matches two 
different types of permutations: for instance, Theorem 2(e) with g^ = 2, a 2 = 3, a 3 = 1, and 



2 asserts that 5 



5;2,3,1 



<^5;2,i,4 as indicated in the first entry of the table. 



Q 


a\a 2 


Uq;ai,a,2 


3 


NS: 21 
SN: 12 


.9990633 
.0009367 


4 


NS: 31 
SN: 13 


.9959280 
.0040720 


5 


NS: 21,24,31,34 


.952140 


NN: 23,32 
SS: 14,41 


1/2 


SN: 12,13,42,43 


.047860 



TABLE 4.1. Two-way races for the moduli q = 3, 4, 5 
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axa 2 a 3 


"5;ai,a2,03 


NNS: 231,234,321,324 
NSS: 214,241,314,341 


.45678 


NSN: 213,243,312,342 
SNS: 124,134,421,431 


.03859 


SNN: 123,132,423,432 
SSN: 142,143,412,413 


.00464 



Table 4.2. Three-way races modulo q = 5 



As mentioned at the beginning of this section, we are confident from numerical consider- 



ations that the numbers reported in Table |4^ are accurate to the five decimal places given 
there, with a possible error of one or two units in the fifth decimal place. Thus, for instance, 
if we choose a particular triple of residues such as {1,2,3} and add up the densities from 
Table [|7| corresponding to the six permutations of that triple, the result is 1.00002. More- 
over, the three ordered triples {3, 2, 1}, {2, 3, 1}, and {2, 1, 3} are the three permutations in 
which 2 is ahead of 1, and so we have the identity 



J 5;2,l 



"5;3,2,1 + ^5;2,3,1 + ^5;2,1,3 



(4.1) 



(cf. equation (5.1)). Table O gives .952140 for the left-hand side of this identity, while 



adding the appropriate entries from Table [4.2| gives .95215 for the right-hand side. 

There are two reasons why our calculations of the densities in three-way races for moduli 
other than 8 and 12 are less accurate than the full six-decimal-place accuracy proven in 
Theorem 1, both stemming from the fact that there are complex- valued Dirichlet characters 
associated with the other moduli. First, when we calculate the function F T (z,x) we do so 
only on a discrete set of points, evenly spaced at intervals of e/2. These points are the only 
ones needed to evaluate the sum Ss-3,5j(e,C,T), as we see from its definition (|3.2|), but for 
the sums corresponding to other moduli we need to know the value of Fr(z, x) at irrational 
multiples of e. We estimated this value by interpolating linearly between the two nearest 
values, and this estimation introduces an additional error into the calculations. 

Second, the zeros of L-functions corresponding to complex characters are not symmetric 
with respect to the real axis, and so the quantity S 7>0 1/(| + 7 2 ), needed to compute 
bi(T, x), cannot be evaluated in closed form. Since we can evaluate &i(T, x) + bi(T,x) in 
closed form, we used half of this quantity in place of both bi(T, x) and &i(T, x); this gives 
the correct first-order approximation to the tail of F(z, x)F(z, X), but the absolute error in 
our calculations can be somewhat higher as a result. For higher moduli, the sheer number 
of characters will also play a role, as the product of the 0(g) — 1 functions Ft(z, x) required 
for the evaluation of p q - ai ,...,a r will gradually erode the accuracy of the calculated number. 

Since there are precisely four reduced residues modulo 5, it is natural to look at the 
complete four-way race mod 5; Table |4.3j shows the calculated densities for this four-way 
race. Here again, the densities only depend on whether oi, a 2 , a 3 , and 04 are squares mod 5, 
by the symmetry results from parts (a) and (b) of Theorem 2, with the added symmetry in 
the third entry of the table following from Theorem 2(e). Once again we can estimate the 
accuracy of these densities by comparing the sum of all twenty-four densities to 1, and also 



BIASES IN THE SHANKS-RENYI PRIME NUMBERS RACE 



:S5 



d\d20,^0ji 


"8;ai,a2,a3 


NNSS 


2314,2341,3214,3241 


.21136 


NSNS 


2134,2431,3124,3421 


.02985 


NSSN 
SNNS 


2143,2413,3142,3412 
1234,1324,4231,4321 


.00424 


SNSN 


1243,1342,4213,4312 


.00028 


SSNN 


1423,1432,4123,4132 


.00007 



Table 4.3. The full four- way race modulo q = 5 



by comparing the values here to those in Table [O using identities such as 



d. 



5;1,2,3 



-'5:4,1,2,3 



+ 5 ; 



5:1,4,2,3 



+ s : 



5;1,2,4,3 



+ 4 



5;1,2,3,4- 



In all cases, these sums of densities from Table [4.3| are precise to within a few units in the 
fifth decimal place. 

In the calculation of these four-way densities, the general formula given in Theorem 4 
involves a three-dimensional integral which must be computed numerically. Performing this 
calculation with a reasonable degree of accuracy lies at the limit of the computing capabilities 
of the method used for the calculations in this paper; in particular, we found it necessary 
to reduce the value of C and increase the value of e somewhat to make the computations 
feasible. 

Since the distribution of the primes into residue classes modulo 6 is fully determined 
by their distribution mod 3, the next modulus of interest is q = 7. Table |4.4| shows the 
calculated densities S 7]aiA2 for the two-way races modulo 7. Here for the first time, we see 
that the density does not depend merely on whether a\ and a 2 are squares mod 7: the 
squares mod 7 are {1,2,4}, and so each of the top two lines of the table are densities of 
the form Sr-N,s, while each of the bottom two lines are densities of the form Sr-s,N- in other 
words, Chebyshev's bias is not the only factor causing asymmetries in the Shanks-Renyi 
race games. (For a somewhat more precise discussion of Chebyshev biases for r-tuples with 
r > 3, see the discussion of "bias factors" in Section 6.) The middle row of the table again 
indicates the known fact that all densities of the form ^at.jv an d S?.s t s equal 1/2. 

Table ^4.5| gives the calculated densities for the three-way races modulo 7. Because the 
number of different values for the densities is larger than in the previous cases, we have 
not organized them strictly by decreasing size, but rather we have grouped together the 



a 1 a 2 


"7;ai,<Z2 


31,32,51,54,62,64 


.874349 


34,52,61 


.845210 


12,14,21,24,41,42, 
35,36,53,56,63,65 


1/2 


16,25,43 


.154790 


13,15,23,26,45,46 


.125651 



Table 4.4. Two-way races modulo q = 7 
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aia 2 a 3 



s> 



7;ai,a2,Q3 



512 
521 
251 
152 
215 
125 



, 314; 631; 651 


621 


341; 361; 561 


612 


431; 316; 516 


162 


134; 613; 615 


261 


413; 136; 156 


126 


, 143; 163; 165 


216 



324 
342 
432 
234 
423 
243 



532; 
352; 
325; 
523; 
235; 
253; 



562 
652 
625 
526 
265 
256 



641; 542; 
614; 524; 
164; 254; 
461; 452; 
146; 245; 
416; 425; 



354 
534 
543 
345 
453 
435 



364 
634 
643 
346 
463 
436 



.4038 
.3678 
.1027 
.0736 
.0295 
.0226 



312,321; 351,531; 514,541 
132,231; 315,513; 154,451 
123,213; 135,153; 145,415 



362,632; 624,642; 564,654 
326,623; 264,462; 546,645 
236,263; 246,426; 456,465 



.3943 
.0857 
.0200 



124,142,214,241,412,421 



356 



365,536,563,635,653 



1/6 



Table 4.5. Three-way races modulo q = 7 



values corresponding to isomorphic race games. We say that two r-tuples {a\, . . . ,a r } and 
{&i, . . . ,b r } of reduced residue classes mod q have isomorphic race games if there exists a 
bijection r from the set {1, . . . ,n} to itself such that each residue aj acts exactly like the 
corresponding residue b T (j), i.e., if 



S, 



<?; a a(l). 



l a(r) 



s, 



q;b T 



("(!))> 



°r(<T(r)) 



for every permutation a of {1, . . . , n}. 

For instance, Theorem 2(a) tells us that ^7 ; i,2,5 = ^7;i,3,4 an d similarly for the corresponding 
permutations of {1,2,5} and {1,3,4}. Therefore the bijection r: {1,2,5} — * {1,3,4} given 
by r(a) = a~ l (mod 7) shows that these triples have isomorphic race games. Table 4~5 



shows that there are ten triples whose race games are in the isomorphism class determined 
by {1, 2, 5}; the six densities for the race games in this class are all distinct. In addition, there 
are five triples in the isomorphism class of {1,2,3}; the race games in this class have only 
three distinct densities due to an additional symmetry generated by Theorem 2(a). Finally, 
the two special triples {S,S,S} = {1,2,4} and {N,N,N} = {3,5,6} each give completely 
symmetric race games; this is the smallest modulus to which parts (d) and (e) of Theorem 
2 can be applied, since three distinct squares or nonsquares are needed. The complete 
symmetry for these two race games was also proven by Rubinstein and Sarnak. We remark 
that our computations of these densities yielded 1/6 to five decimal places. We did not 
proceed further with computations modulo 7, since there is no natural four-way race and 
races with five or more residues are beyond the present capabilities of our computing set-up. 



Table |4T6| shows the calculated densities for the two-way races modulo 8. Because only one 
fourth of the residues mod 8 are squares (i.e., c(8, 1) = 3), in contrast to the lower moduli, 
there are fewer symmetries among the densities. (This is somewhat counterintuitive, since the 
multiplicative group modulo 8 is highly symmetric.) This higher value of c(8, 1) also causes 
a larger bias towards nonsquares, as can be seen by the fact that the values in Table [4.6| are 
more extreme than those in Tables 14.11 and 14.41. 



Table |4.7| shows the calculated densities for the three-way races modulo 8, including the 
values for S 8] n,n,n highlighted in Theorem 1. Since all of the characters mod 8 are real, the 
additional sources of computational error mentioned in the discussion of Table fO] are not 
present here, and so we feel justified in reporting these figures to seven decimal places; in 
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37 



aia 2 


"8;ai,a2 


31 
13 


.9995688 
.0004312 


51 
15 


.9973946 
.0026054 


71 
17 


.9989378 
.0010622 


35,37,53, 

57,73,75 


1/2 



Table 4.6. Two-way races modulo q = 8 



CLICL201S 


^8;ai, 02,03 


CL1CL2O13 


"8;ai,a2,a3 


01Q2O3 


"8;ai, 02,03 


CLlOi20'3 


"8;ai ,02,03 


531 
351 
315 
513 
135 
153 


.4996015 
.4974123 
.0025550 
.0003808 
.0000327 
.0000177 


731 
371 
317 
713 
137 
173 


.4995765 
.4989440 
.0010483 
.0004173 
.0000077 
.0000062 


571 
751 
715 
517 
175 
157 


.4990135 
.4974474 
.0024769 
.0009337 
.0000757 
.0000528 


357,753 


.1928013 


375,573 


.1664263 


735,537 


.1407724 



Table 4.7. The four three-way races modulo q = 8 



fact note that the sums of the appropriate three-way densities sum to the two-way densities 
in Table |51] in a manner analogous to equation ( fl.lp , with the sums all agreeing to within 
one or two units in the seventh decimal place. 

As with the modulus 5, it is natural to look at the complete four-way race modulo 8; 
Table |4.8j shows the calculated densities for this four-way race, listed in the lexicographical 
ordering on the permutations of {1,3,5,7}. Despite the need to use slightly cruder values 
of C and e in the calculations of the three-dimensional integrals arising in the formulas for 
these densities, the sum of all 24 densities and numerical checks against Table |4.7| suggest 
that these densities are also accurate to within one or two units in the seventh decimal place. 



Tables [4.9| and |4.10| show the calculated densities for the two-way and three-way races 
modulo 9. Since the multiplicative group mod 9 is isomorphic to the multiplicative group 



dl(l2Cb3Cb4. 


"8;ai, 02,03, 04 


Cli(32fl3Cl4 


"8;oi ,02,03,04 


d\(l2Cis(l4. 


"8;oi ,02,03,04 


CLlQ,2Q'3Q'4 : 


"8;ai, 02,03, 04 


1357 


.0000014 


3157 


.0000500 


5137 


.0000027 


7135 


.0000261 


1375 


.0000029 


3175 


.0000696 


5173 


.0000023 


7153 


.0000154 


1537 


.0000007 


3517 


.0007972 


5317 


.0001315 


7315 


.0008983 


1573 


.0000006 


3571 


.1919526 


5371 


.1406374 


7351 


.1398456 


1735 


.0000023 


3715 


.0015371 


5713 


.0000848 


7513 


.0002910 


1753 


.0000009 


3751 


.1648170 


5731 


.1663386 


7531 


.1924939 



Table 4.8. The full four- way race modulo q = 8 
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Gna 2 


"9;ai,a2 


21,24,51,57,84,87 


.881584 


27,54,81 


.864230 


14,17,41,47,71,74, 

25,28,52,58,82,85 


1/2 


72,45,18 


.135770 


12,15,42,48,75,78 


.118416 



Table 4.9. Two-way races modulo q = 9 



aia 2 a 3 



& 



9;ai,a2,a3 



514 


217; 821 


851 


841 


541 


271; 281 


581 


814 


451 


721; 218 


518 


184 


154 


127; 812 


815 


481 


415 


712; 128 


158 


148 


145 


172; 182 


185 


418 



247 
274 
724 
427 
742 
472 



524 
254 
245 
542 
425 
452 



584 
. 854 
; 845 
; 548 
; 485 
;458 



871; 574 


257 


817; 547 


527 


187; 457 


572 


781; 754 


275 


178; 475 


752 


718; 745 


725 



287 
827 
872 
278 
782 
728 



.4010 
.3814 
.0992 
.0819 
.0194 
.0172 



214,241; 517,571; 251,521 
124,421; 157,751; 215,512 
142,412; 175,715; 125,152 



284,824; 847,874; 587,857 
248,842; 487,784; 578,875 
428,482; 478,748; 758,785 



.3965 
.0885 
.0149 



147,174,417,471,714,741 



258,285,528,582,825,852 



1/6 



Table 4.10. Three-way races modulo q = 9 



mod 7 (both are cyclic of order 6), the various symmetries present in Tables 
mirror those found in Tables 



andElO 



and [151 , with the squares mod 9 being {1, 4, 7}. 
Again, the distribution of the primes into residue classes modulo 10 is determined by 
their distribution mod 5, so the next modulus of interest is q = 11. Table |4.11| shows the 
calculated densities for the two-way races modulo 11, where we have used the symbol T to 
represent the residue 10 mod 11. In the middle row, the entry "NN,SS" refers to the forty 
pairs {01,02} where ai and a 2 are either both among the nonsquares {2,6, 7,8, T} or both 
among the squares {1, 3, 4, 5, 9} mod 11. 



a\a 2 


"ll;ai,a,2 


23,25,64,69,71, 75,81,89,T3,T4 


.761121 


21,24,61,63,73,79,84,85,T5,T9 


.731135 


29,65,74,83,T1 


.713943 


NN,SS 


1/2 


1T,38,47,56,92 


.286057 


12,16,36,37,42,48,58,5T,97,9T 


.268865 


17,18,32,3T,46,4T,52,57,96,98 


.238879 



Table 4.11. Two-way races modulo q = 11 
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:W 



We do not include the calculations of the three-way races mod 11 for reasons of space. Us- 
ing Theorem 2 it can be checked that of the 120 distinct (unordered) triples of residues mod 

11, the twenty triples of the form {ab~ l , a, ab} with b a square mod 11 comprise two isomor- 
phism classes of race games of ten triples each; a race game in either of these isomorphism 
classes has only two distinct densities, one taken by four permutations of the triple and the 
other taken by the other two permutations. The forty triples of the form {ab' 1 , a, ab} with 
b a nonsquare mod 11 form four isomorphism classes with ten triples in each class; a race 
game in one of these classes has three distinct densities, each taken by a pair of permutations 
with the same middle element. Finally, the remaining sixty triples form three isomorphism 
classes of twenty race games each; a race game in one of these classes has all six densities 
distinct. There are 34 densities that remain to be calculated after these symmetries from 
Theorem 2 are taken into account, and the calculations reveal that these 34 densities are 
indeed distinct. 

As mentioned previously, determining the densities in a five- way race game lies beyond the 
scope of the computing methods used for the calculations in this paper (though this barrier is 
only technological, as Theorem 4 is valid for arbitrarily large race games). If this barrier were 
overcome (for example, by recoding in a lower level computing language) the five-way race 
among the squares mod 11 and the five- way race among the nonsquares mod 11 would be 
natural and interesting questions to consider, especially in light of the nearly-cyclic behavior 
of the leaders in these five- way race games reported by Bays and Hudson 0. Because of the 
symmetries of Theorem 2, it turns out that only eight distinct densities would need to be 
calculated for both of these five-way race games to be completely determined. 

Tables [4.12| , |4.13| , and [4.14J show the two-way, three-way, and four-way race games modulo 

12, respectively, using the symbol E to represent the residue 11 mod 12. Since the multi- 
plicative group mod 12 is isomorphic to the multiplicative group mod 8 (both groups being 



isomorphic to the Klein group of order 4) , the various symmetries present in Tables |4.12| , [4.13 , 
and [4.14| mirror those found in Tables [4.6| , [4.7| , and [4.8| . As with the modulus 8 case, all the 



characters mod 12 are real- valued, and so we feel justified in reporting seven decimal places 
of the numbers in these tables. 



Notice from Table 4.13 that the densities <5i 2 -5 n i and 5i 2 -7 n i only differ by one unit in 



the sixth decimal place, and that there are several other entries that differ by similarly small 
amounts owing to their small size. Nevertheless, we see no reason to believe that any of 



the twenty-one densities in Table 4.13 is equal to any another. Similar remarks hold for the 



a\a 2 


Ol2;a 1 ,a 2 


51 
15 


.9992059 
.0007941 


71 
17 


.9986061 
.0013939 


El 
IE 


.9999766 
.0000234 


57,5E,75, 

7E,E5,E7 


1/2 



TABLE 4.12. Two-way races modulo q = 12 
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O1O2O3 


"12;ai,a2,a3 


a!a 2 a 3 


0l2;ai,a2,a3 


040.20,3 


0l2;a\, 02,03 


0^0203 


0l2;ai,c22,<23 


751 
571 
517 
715 
157 
175 


.4992728 
.4986582 
.0012750 
.0006751 
.0000668 
.0000521 


5E1 
E51 
E15 
51E 
1E5 
15E 


.4999772 
.4992062 
.0007931 
.0000225 
.0000006 
.0000003 


7E1 
E71 
E17 
71E 
1E7 
17E 


.4999780 
.4986066 
.0013919 
.0000214 
.0000015 
.0000006 


57E,E75 


.1984521 


E57,75E 


.1799849 


5E7,7E5 


.1215630 



Table 4.13. The four three-way races modulo g = 12 



0^020304 


^12;<2i ,<22,<23,a4 


0^020304 


Ol2;a\, 02,03, 04 


0^020304 


"12;ai, 02,03, 04 


O4O2O3O4 


"12;ai, 02,03, 04 


157E 


< 10~ Y 


517E 


.0000004 


715E 


.0000001 


E157 


.0000664 


15E7 


.0000001 


51E7 


.0000010 


71E5 


.0000002 


El 75 


.0000519 


175E 


< 10- 7 


571E 


.0000152 


75 IE 


.0000059 


E517 


.0011332 


17E5 


< 10- 7 


57E1 


.1984364 


75E1 


.1799788 


E571 


.1787850 


1E57 


.0000002 


5E17 


.0001403 


7E15 


.0000243 


E715 


.0006505 


1E75 


.0000001 


5E71 


.1214216 


7E51 


.1215384 


E751 


.1977496 



Table 4.14. The full four-way race modulo q — 12 



twenty-four densities in Table |4.14| and for the corresponding Tables |4.7| and |4.8| for the race 
games modulo 8. One observation supporting our view is that whenever the symmetries of 
Theorem 2 imply that two densities are equal, the computed densities agree to within a few 
multiples of the default machine precision rather than to only five or six decimal places. 



5. Proofs of Theorems 2 and 3. 

In this section we establish Theorem 2, concerning symmetries of the densities <5 0;aii ... iar 
under certain permutations of the residue classes {01, . . . , a r }, and Theorem 3, giving some 
strict inequalities in the same setting. We first present the proof of Theorem 3 since it is 
somewhat simpler than that of Theorem 2. 

Proof of Theorem 3. Let a l5 a 2 , and 03 be distinct reduced residue classes mod q. We 
begin with the simple observation that if x is a real number such that 7r(x; q, a{) > tt(x; q, a 2 ), 
then the quantity 7r(a;; g, a 3 ) must either equal one of 7r(x; q, Oi) and n(x; q, a 2 ), lie between 
them, exceed both, or be exceeded by both. This observation leads to the density identity 

"g;ai,02 "q;03,ai,02 ' "g;oi,03,02 ' "g;ai,02,03; l^'-U 

since the set of real numbers x such that n(x; q, 03) = 7r(x; q, Oi) or tt(x; q, a 3 ) = 7r(x; q, a 2 ) 
has density zero, as mentioned in Section 2.1. It follows that 

"g;ai, 02,03 "g;03,02,ai "g;oi,a2 "5:03,02 5 \p-^l 

by using the appropriate identity of the type ( |5.1D on both terms on the right-hand side of 
equation ( |5.2j ) and simplifying. 

Now we can use our knowledge of the two-way densities to study the difference on the 
left-hand side of (J5.2|). In particular, if c(g, oi) = c(q,a 2 ) then S q . aita2 = 1/2, and hence 
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^501,02,03 — £g;a 3 ,a2,ai = 1/2 — 5 g;a3ia2 , an expression whose sign is known from the work of 
Rubinstein and Sarnak. More specifically, if N and N' are nonsquares mod q while S is a 
square mod g, then 5 q;NtN > tS ~ S q; s,N',N = 1/2 - 5 q;S ,N> > 0; therefore S^n^s > S q;S ,N',N, 
which establishes part (a) of the theorem. Similarly, if N is a nonsquare mod q while S and 
S' are squares mod q, then 5 q -s>,s,N < $q;N,s,S'i which establishes part (b) of the theorem. 

Another application is to the difference S q .N,s,N' ~ $q;N',s,N when N and N' are nonsquares 
mod q while S is a square mod q. In this case equation Q5.2| ) becomes 

5q;N,S,N' ~ 8q;N',S,N = ? ;JV,S ~ 8q;N',S, 

which immediately implies part (c) of the theorem. The analogous observation about the 
difference 5 q; s,N,s' — $q;S',N,s when S and S' are squares mod g while iV is a nonsquare mod 
q establishes part (d) of the theorem. □ 



We remark that the identity (|5.2| ), applied when oi, 02, and 03 are all nonsquares mod q, 
becomes S q;ai>a2:aa — S q;a3j(l2)ai = 0; this is another way of seeing that the densities calculated 
in Theorem 1 are equal in pairs as indicated. 

Our next goal is to establish Theorem 2. Before doing so it will be helpful to recall the 
relationships between the density S q;ai ,...,a r and the measures fi q]ai ,...,a r and p q - ai ,...,a r - We 
begin by recalling from equation ( |2.5| ) that 

Oq; ai ,...,a r = ' ' ' "/A?;ai,...,a r • (5.o) 

Xl>—>X T 

We remark that if a is a permutation of the indices {1, . . . , r}, then we can express the 
density 5« ?; a <T(1) ,...,a CT(r) in two different ways: we have 



"?;<Vi) >— > a o-to 7 7 ^^^(i)'-' " 



T(r) 

xi>--->a; r 



corresponding to the formula (|5.3|) , but we also have the alternate form 



"?;<V(i),-,a CT (r) — J J d^g-cn,...,^ 

»<7(1)> — >»<7(r) 

since fi q]ai ,...,a r is the limiting distribution of the vector (-E(x; g, Oi), . . . , £?(x; g, o r )), whose co- 
ordinated are ordered by size exactly as the coordinates of the vector (7r(x; g, ai), . . . , 7r(x; g, a r )). 

If we make the change of variables U\ = X\ — x 2 , . . . , w r _i = x r _i — x r , w r = x r and 
integrate out the variable u r , as in Section 2.5, the formula ( |5T3| ) becomes 



<W...,or = 7" "7 d Pg;ai,...,a r - (5-4) 

ui>0, ..., u r -i>0 

For the special permutation a that reverses the set {1, . . . , n}, we see that 

X a (l) > ■ ■ > ^cr(r) ^=^ S r > ■ ■ • > X\ -<=>- U r -\ < 0, . . . , U\ < 0. 

Consequently, we have 

Og;a r ,...,ai = wPg;oi,...,ar (.5.5) 

Ml<0, ...,U,-1<0 

as a companion formula to equation (|5.4|) . 
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As a final prerequisite to the proof of Theorem 2 we recall from equation ( |2.13|) the explicit 
formula 



frq;ai,...,ar{£l>----tr)=exP\i'52c(q,a j )Z j ) J] F [ 



3=1 



\modq 

x/xo 



i=i 



X 



(5.6) 



for the Fourier transform of p q - ai ,...,a r , and the related formula (2.21) 
p qm ,.,„ar(vi,---,Vr~i) =exp f ^2(c(q,aj) - c(q,a j+1 ))rjA 



x 



n f ( 


r-1 


-x( a i-i)H' 


Xmodg V 


J=l 




XT^XO 







X (5-7) 



for the Fourier transform of p q - ai ,...,a r - 

Proof of Theorem 2. Let aj 1 denote the inverse of Oj mod q. We will show that the Fourier 
transforms p q]ai ,...,a r and /L;a~\... a^ 1 are the same function. This is enough to establish part 
(a), since the densities p qm ,...,a r and /i„. a -i a -i will then be identical, which by equation 
will imply 



d q ;ai,...,a r ~ "A%ai,...,a r — ^QA~ 



q;a,i ,...,o^ ' 



x\>X2>--->x r 



xi>X2>--->x r 



We use the formula ( |5.6| ) for /L. ai ar and the analogous formula for //„.„- 1 _-i. Notice 

that the square roots of aj 1 are precisely the inverses mod q of the square roots of aj. In 
particular, c(q, aj 1 ) = c(q, aj), and so the exponential term in the formula (|5.6|) is unchanged 
if we replace each aj by aj 1 . Moreover, we see that for each character \ mod q, 



i=i 



Y, X(aj)£j 

i=i 



i=i 

since the £j are real, so that each term F(-, x) i n ( |5.6[ ) is also unchanged by replacing all of 
the aj with the aj 1 . This shows that fi q - ai ,...,a r = /Va- 1 a" 1 ' which establishes part (a) of 
the theorem. 

We use a similar strategy to prove part (b). Let b be a reduced residue class mod q such 
that c(q, aj) = c(q, baj) for each 1 < j < r. Because of this hypothesis, the exponential term 
in the formula ( |5.6|) is unchanged if we replace each aj by baj as above. Moreover, for each 
character x mod q, 



r 

Yx(baj)^j 
i=i 


= 


r 

X(b) Y, X{a,j)£j 

3=1 



\x 



E x(aj)^ 

3=1 



E X{aj% 

3=1 



{5.1 



so that each term F(-, x) in (|5.6|) is also unchanged by replacing all of the aj with the baj. 
This shows that fM q - au ...,a r — PqMi^-Mn which establishes part (b) of the theorem. 

The proofs of parts (c) and (d) rely on the formula (5.7) for the function p q - ai ,...,a r - When 
the aj are all squares mod q, then the exponential term in (5.7) is identically 1. Moreover, if 
b is a square mod q then each baj is also a square, while if b is a nonsquare mod q then each 
baj is a nonsquare; in either case we have c(q, ba\) = ■ ■ ■ = c(q, ba r ), so that the exponential 
term in the analogous formula to equation (5.7) for p q] b ai ,...,bar i s also identically 1. Since 
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the chain of equalities Q5.8Q again shows that each term F(-,x) is unchanged upon replacing 
the aj with baj, we see that p q - ai ,..., ar = Pq-Mu-Mr and so <W,...,a r = SqMu-Mr by virtue of 
equation ( |5.4| ), which establishes part (c) of the theorem. 

For part (d) we begin with the formula ( |5.5| ) for S q]arj _ >ai . As remarked above, the exponen- 
tial term of p q;ai ,...,a r is identically 1 when the aj are all squares mod q, so that p q - ai ,...,a r will 
be real valued. Since p q - ai ,...,a r is real-valued as well, we conclude that p q - ai ,...,a r is symmetric 
through the origin. Hence making the change of variables Uj i— ► — Uj for each 1 < j < r in 
equation (|5.5|), we obtain 



O q ;a r ,...,ai II U J Pq;ai,...,a r "g;ai,...,a r ; 

ui>0,...,u r -i>0 

which establishes part (d) of the theorem. 

To establish part (e), we first consider the relationship between p q - ai ,...,a r and p q -bai,...,bar 
(note that the residue classes baj have not yet been reversed in the second subscript). Again, 
equation (|5.8|) shows that replacing each aj with baj does not change the terms of the 
form F(-,x), and so we only need to consider the exponential term. Because the quantity 
c(q,a) can only take the two values —1 and c(q, 1), we see that if c(q,a') ^ c(q,a) then 
c(q,a') = c(q, 1) — 1 — c(q,a). It follows that under our hypothesis that c(q,baj) ^ c(q,aj) 
for each 1 < j < r; but we also have 

c(q, ba j+1 ) - c(q, baj) = -(c(g, a j+1 ) - c(q, aj)), 

and so the imaginary expression in the exponential term in equation (|5.6[) is negated upon 



replacing each aj by baj. The end result is that p q -b ai ,...,bar = Pq-,ai,...,a r , which implies that 
when the measure p q] b ai ,...,bar is reflected through the origin, the resulting measure is identical 
to p q - ai ,...,a r - 

Since we can express 

O q ;ba r ,.--,bai Pq;bai,...,ba r 

«i<0, ...,M,-_1<0 

as in equation ( |5.5| ), we can make the change of variables Uj i— >• — Uj for each 1 < j < r — 1 
to see that 



O q ;ba r ,...,bai II P?;tti,-..,a r Oq;a\,...,a r - 

«i>0, ...,u,--i>0 

This establishes the final assertion of the theorem. □ 

6. Remarks, Questions, and Open Problems. 

In this final section, we collect together several observations, unanswered questions, and 
conjectures concerning the results of this paper. 

Systems of inequalities with one equality. Since we know that S q]aj b and S q -b t a are both positive 
(assuming GRH and LI), it follows that each inequality 7r(x; q, a) > 7r(x; q, b) and n(x; q, b) > 
7i(x;q,a) has arbitrarily large solutions, and therefore 7r(x; q, a) = Tr(x;q,b) for infinitely 
many integers x. However, knowing that o~ q - a ,b,c and S q -b, a , c are both positive — i.e., that each 
string of inequalities 

7r(x; q, a) > n(x; q, b) > 7r(x; q, c) and ti(x; q, b) > 7r(x; q, a) > ti(x; q, c) 
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has arbitrarily large solutions — does not imply that there are necessarily any solutions to 
n(x;q,a) = ir(x;q,b) > ir(x;q,c). Undoubtably, the equality n(x;q,a) = ir(x;q,b) should 
hold infinitely often both when their common value exceeds 7i(x; q, c) and when their value 
is exceeded by 7r(x; q, c). We conjecture more generally that for any given integer 1 < j < r 
and reduced residue classes a\, . . . , a r and a'j mod q, the conditions 

7r(x; q,ai) > ■ • • > 7r(x; q, dj) > • • • > 7r(x; q, a r ) 

7r(x;g,o^) 
should be satisfied for infinitely many integers x. 
Multiple equalities. Another direction along these lines involves solutions to 

tc(x; q, a\) = n(x; q, 0,2) = • • • = vr(x; q, a r ) (6.1) 

when r > 3. If we consider the vectors 

^;ai,...,orW = (n(Pn;q,ai) - n(p n ; 9, 02), . . . , 7r(p n ; q, a r _i) --K(p n ;q,a r )), 

(6.2) 

where p n denotes the nth prime, then the sequence of vectors {V^ ;0l) ... )ar (n)} might reasonably 
be expected to resemble a random walk on Z r_1 , where the possible steps at each stage are 
(1, 0, . . . , 0), (— 1, 1, 0, . . . , 0), ... , (0, . . . , 0, — 1, 1), and (0, . . . , 0, —1) and are chosen with 
roughly equal probabilities. (Even though the Chebyshev bias will cause a drift in the mean 
behavior of the vectors Q6.2J ), this drift has the same order of magnitude as the standard 
deviation of the random walk). 

Since random walks on Z n return to any point infinitely often with probability 1 when 
n = 1 or 2 but fail to do so with probability 1 when n > 3 (Polya 0), this heuristic leads to 
the prediction that the system of equalities (|6.1| ) has infinitely many solutions when r < 3 
but only finitely many solutions for r > 4. Similar reasoning suggests that any pair of 
equalities 

ir(x; q, ai) = tt(x; q, a 2 ), 7r(x; q, 0,3) = n(x; q, 04) 

with ai, . . . , »4 distinct should simultaneously hold for arbitrarily large values of x, but 
three or more equalities will hold simultaneously only finitely many times. Further, we 
might expect that the conditions 

7r(x; q, ai) > • • • > 7r(x; q, dj) > ■ ■ ■ > 7r(x; q, a r ) 
7r(x;g,a$) 

ir(x; q, a'-) 

and 

7r(x; q, Oi) > ■ • • > 7r(x; q, ai) > ■ ■ ■ > 7r(x; q, a,j) > ■ ■ ■ > tt(x; q, a r ) 

vr(x; q, <) n(x; q, a'j) 

should hold for infinitely many integers x, but that analogous conditions involving three or 
more equalities would not. 
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Bias factors. To try to quantify the Chebyshev biases for r-tuples of reduced residue classes 
aj mod q for all r > 2, let us define the "bias factor" P q - ai ,...,a r to be the difference between the 
number of nonsquares preceding squares among the Oj and the number of squares preceding 
nonsquares: 

Pq;a u ...,a r = #{« < j : Of ¥" □, CLj = U ) ~ #{* < j '> <H = U, aj ^ D} 

_ y-y- c(q, aj) - c(q, Oj) 

' k<kr C (^ 1 ) + 1 (6.3) 

1 



Y. (2j-r-l)c(q,aj). 



c{q, 1) + 1 1£j£r 

For instance, when r = 2 the possible bias factors are /3 q] N,s — 1> Pq-,N,N — Pq-,S,s = 0> and 
Pq-,s,N = — 1- Rubinstein and Sarnak proved that the sign of <5 g;a) ft — 1/2 equals the sign of 
f3 q;a ,b in this notation, thereby showing that 

Pq;a,b > Pq;a',b' => $q;a,b > $q;a',b'- 



The converse to this statement is false: the first two lines of Table [1.4| shows that 5 qi a,b and 



8q-,a',b' can be different even when P q]a ,b = Pq;a',v, for instance. 

We might hope that the bias factors f3 q]ai ,...,a r would provide some information about the 
relative sizes of the £ g;ai ,...,a r , perhaps in the form of the implication 

flq;ai,...,a T > @q;bi,...,b T =^ fiq;ai,...,a r > 3q;bi,...,b T (6-4) 

for any fixed r. In this regard, it is worth remarking that all of the symmetries in Theorem 
2 are equalities between two r-tuples of residues with equal bias factors. Examining the 
densities computed in Section 4, we observe that the implication (|6.4j) holds most of the 
time, but we do note the following two anomalies: 

• /38;5,1,3,7 = /?8;5,1,7,3 = _1 > ~ 3 = &;1,3,7,5, but ^ appears from Table g^ tna t ^8;1,3,7,5 

slightly exceeds both Ss-,5,i,3j and #855,1,7,3; 



• /3i2;7,i,ii,5 = /3i2;7,i,5,n = — 1 > — 3 = /3i2 ; i,ii,5,7, but it appears from Table [4.14| that 
#i2;i,ii,5,7 slightly exceeds both <5i 2 :7,i,n,5 and <5i 2; 7,i,5,ii- 
It would therefore be of interest, in connection with determining whether the implication ( |6.4| ) 
is always valid, to compute more precisely the densities just mentioned in order to verify the 
apparent inequalities. 

Unfortunately, the computation of the densities to arbitrary precision is not simply a 
matter of reducing e and increasing C and letting a bigger computer run for a longer period 
of time. The major source of error in these computations is the effect of truncating the 
infinite product defining the functions F(z,x) to form the approximations Ft(z,x) ( see 
Section 3.3); to decrease this error it would be necessary to compute zeros of the relevant 
L-functions to a height greater than 10,000, and perhaps to greater precision than twelve 
decimal places as well. 

It is certainly conceivable that some definition of bias factor different from ( |6.3|) might be 
better suited to the role of /3 q]ai ,...,a r , although it is hard to imagine what natural definition 
would be able to explain the apparent anomalies noted above. It might also be the case 
that the implication ( |6.4| ) is valid in more limited settings — for instance, when we restrict 
to r-tuples {01, . . . , a r } and {61, . . . , b r } where exactly half of the aj are nonsquares and half 
squares, and similarly for the bj. 



46 ANDREY FEUERVERGER AND GREG MARTIN UNIVERSITY OF TORONTO 



Convergence to unbiased distribution. Rubinstein and Sarnak proved |10, Theorem 1.5] that 
for a fixed integer r > 2, 



( 



max \r\S q;ait ,.. >ar - 1|) ->• (6.5) 

a\,...,a r *' ' ' J 



as q tends to infinity (where the maximum is taken over all r-tuples of distinct reduced residue 
classes mod q), so that biases of any sort become less and less evident with increasing moduli. 
Thus although the biases in the two-way races mod 8 and mod 12 are more pronounced than 
those in the two-way races mod 4, 5, and 7 owing to the larger values of c(8, 1) = c(12, 1) = 3, 
these sorts of extreme biases will not continue (even with a sequence of moduli such as 
q n = 4p 2 P3 • • -Pn, say, which satisfies c(q n , 1) = 2 n — 1). 

On the other hand, it might happen that an extremely negatively biased density such as 
fiq;Si,...,Sn,Ni,...,N n m ight tend to zero much more rapidly than l/(2ri)\ as n increases, while an 
extremely positively biased density such as 5g ; jv 1) ...,jv n) s 1 ,...,s„ might behave more like l/(n!) 2 . 
In general, one could investigate the uniformity of the statement ( |6.5| ), i.e., attempt to show 
that the statement holds uniformly for all r < r for some integer- valued function r = r (q) 
satisfying 2 < r < 4>(q). For instance, is it the case that 

limsupf max r ! S q . au .. a ) = oo and liminff min r ! S q - ai a ) = 

g^oo \ai,...,Or y ' 0/ q ^oo Vai,...,<Zr *' °/ , s 

if r = r (q) grows sufficiently quickly, and if so, how quickly must r grow with q for these 
phenomena to emerge? We certainly conjecture that 

lim ( max r ! S q . au _._ >a ) =0 (6.7) 

q— >oo \ ai,...,a r() *' ' ' 0/ 

for any arbitrary function r = r (q) tending to infinity with q, but at this point it seems 
nontrivial to prove this modest result even for r = <fi{q) itself. 

Race-game symmetries, isomorphisms, and order equivalences. Another question of interest 
is whether there exist more symmetry results of the type arising in Theorem 2. Reviewing the 
proof of Theorem 2, we see that all of the symmetries therein are consequences of provable 
equalities between two distributions of the type fiq- ai ,...,a r or Pq;ai,...,a r (possibly after reflecting 
one of the distributions through the origin). We can then ask 

(1) whether there exist any equalities between these distributions other than those used in 
the proof of Theorem 2; 

(2) whether there can be numerical "coincidences" between two densities even though their 
underlying distributions are not related. 

An answer to question (1) might be forthcoming from a careful analysis of the Fourier 
transforms p g]ai ,...,a r of the distributions p q]ai ,...,a r - As for question (2), it seems reasonable to 
believe the phenomenon addressed therein can never occur, but proving such a claim seems 
very difficult. 

In support of the possibility that Theorem 2 accounts for all numerical equalities between 
the densities S q - ait ___ }ar , we remark that among the densities computed in Section 4, each time 
a symmetry from Theorem 2 was applicable the corresponding computed densities were equal 
to within a small multiple of the machine precision. Conversely, all such numerical equalities 
observed among the computed densities are accounted for by the symmetries already asserted 
in Theorem 2. 
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Symmetries among individual densities S q . aii _ tar are of course closely related to isomor- 
phisms between complete race games of r-tuples. Theorem 2 implies that the following 
bijections between r-tuples induce isomorphisms of race games: 

• the map r(a,) = aj 1 (mod q) between the r-tuples {a\, . . . , a r } and {% \ . . . , a^ 1 }; 

• the map r(aj) = ba,j (mod q) between the r-tuples {ai, . . . , a r } and {bai, . . . , ba r }, if 
either c(q, aj) = c(q, 1) for each 1 < j < r or c(q, a,) = c(q, bttj) for each 1 < j < r; 

• the map r(aj) = ba r+ i-j (mod q) between the r-tuples {oi, . . . , a r } and {ba r , . . . , bai}, 
if c(q, aj) 7^ c(q, bdj) for each 1 < j < r; 

• either bijection r : {a, b} — > {a', b'}, if c(q, a) = c(q, b) and c(q', a') = c(q', b'); 

• any bijection r : {a, b, c} — > {a', b', c'}, if there exists p ^ 1 (mod q) with p 3 = 1 (mod q) 
such that b = ap (mod q) and c = ap 2 (mod g) and an analogous p' (mod g'). 

(Our definition of isomorphic race games required that the r-tuples consist of reduced residues 
to the same modulus, but the definition has an obvious extension to two r-tuples of residues 
to different moduli which encompasses the last two isomorphisms.) We conjecture that any 
isomorphism between two race games is induced by a composition of bijections from this list; 
in particular, the only isomorphisms between race games of distinct moduli are those race 
games with complete internal symmetry, which were determined by Rubinstein and Sarnak. 
A weaker relationship than isomorphic race games is order-equivalent race games, where 
there exists a bijection r on the set {1, . . . , n) such that 

"9;a CT (i),--a CT (r) > "9; a V(i)v,<V( r) ^ ^ < V;&t( ct (i)),-,*V( -(») > < V;&T(<r'(i))>---,& T ( CT '(r)) 

(6.8) 

for any two permutations a, a 1 of {1, . . . ,n}. Order-equivalent race games seem common 
for small values of r. For instance, any two race games both of the form {iV, S} are order- 
equivalent by Rubinstein and Sarnak's results. The tables in Section 4 indicate many three- 
way race games that are order-equivalent. The triples {iV, N', 1} mod 7 with iVW ^ —1 
mod 7, the triples {iV, N', 1} mod 8, the triples {iV, N', 1} mod 9 with A^A^' ^ -1 mod 9, 
and the triples {iV, N', 1} mod 12 are all order-equivalent to one another. Also, the triples 
{iV, -iV" 1 , S} mod 5, the triples {iV, -iV" 1 , S} mod 7, and the triples {N, -N' 1 , S} mod 9 
are all order-equivalent as well (but note that these are not order-equivalent to the triples 
{iV, N, N} mod 8 and mod 12). 



We remark that from the values in Tables 4.8 and 4.14, the bijection 



r(l) = l, r(3) = ll, r(5) = 7, r(7) = 5 

is quite close to inducing an order-equivalence between the full four-way race games modulo 
8 and 12, respectively (in the sense that the values in these tables would only have to be 
modified by at most 6 x 10 -5 in order for the condition (|6.8| ) to always hold). It would 
certainly be interesting to try to establish (or even classify) order-equivalent race games, 
especially for larger values of r and between r-tuples to different moduli. 

Another problem of Knapowski-Turdn. In their paper 0, Knapowski and Turan pose many 
problems in comparative prime number theory, several of which have been answered by 
Rubinstein and Sarnak []T0[ and herein. We conclude by mentioning one other problem given 



by Knapowski and Turan in B. They ask whether, for any r-tuple a%, . . . ,a r of reduced 
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residue classes mod q, the inequalities 

7r(x;g,ai) < — -, tc{x; q, a 2 ) < — r T , . .., 7i(x; q, a r ) < — - (6.9) 

simultaneously hold for arbitrarily large values of x. Each individual inequality is unbiased 
if cij is a nonsquare mod q and biased negatively if a,j is a square mod q. We remark here 
that if we apply the method of Rubinstein and Sarnak [JlO] to the error term 

logx 

/x~ 



Ei(x;q,a) = —^[ip(q)ir(x;q,a) -li(a:)j, 



which has been centered in a slightly different way than in the definition Q2.1| ) of E(x; q, a), 
we can see that this question of Knapowski and Turan is answered in the affirmative, and in 
fact the set of real numbers x satisfying the inequalities ( |6.9| ) has positive density as well. 
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